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Nonrandom Sources of Variation in Vowel Quality 


WILLIAM R. TIFFANY 


It is well known that repeated utter- 
ances of the same vowel phoneme by a 
speaker or speakers will result in varia- 
tions in observed phonetic quality as 
well as in measurable acoustic charac- 
teristics. Whereas much of this varia- 
tion may be of an essentially random 
nature, certain nonrandom sources of 
variation have been discussed. It is well 
known, of course, that the phonetic 
context will influence the nature of the 
vowel. This was pointed out by Black 
(1) and others early in the history of 
acoustic phonetics. Even where pho- 
netic context is held constant, however, 
and when the allophone is the unit 
studied, important inter- and intra- 
subject variations have been observed 
(2, 3, 4). 

It is generally acknowledged that the 
production of vowel sounds by an indi- 
vidual is not a random process and that 
the values of the acoustic measure- 
ments of the vowels are therefore not 
distributed in random order. The data 
found by Peterson and Barney (2) 
were said to reveal that ‘both the pro- 
duction and the identification of vowel 
sounds by an individual depend on his 
previous language experience.’ 

In their reference to the ‘previous 
language experience’ of the individual, 
Peterson and Barney were primarily 
concerned with the dialect influences, 
but did not specifically exclude other 
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aspects of language experience which 
might not normally be thought of as 
dialectal in nature. 

Among the possible sources of non- 
dialectal, nonrandom variation in a 
single speaker’s repeated utterances of 
an allophone, or among the utterances 
of many speakers, at least one has al- 
ready been the subject of study. The 
influences of sex and age, as they gov- 
ern resonator size, are well known. The 
data of Potter and Steinberg (4) and 
others make clear the relative nature 
of the formant patterns and appear to 
show conclusively that although form- 
ant ratios remain fairly stable the 
formant positions vary systematically 
with age and sex. 

Except for normative studies which 
have contributed information concern- 
ing sex and age differences and studies 
which have investigated the language 
variable, there has been little attempt to 
identify the sources of nonrandom vari- 
ations in vowel quality among and 
within subjects. 

One possible source of variation in 
vowel quality which has not as yet 
been systematically studied is that con- 
tributed by speech training, or, what 
is presumed to be a result of this, speech 
excellence. Although it is difficult at 
first hand to see how this is a variable 
which is distinct from that of speech 
experience or dialect, an a priori case 
can be made for such a position. 

To see how such a case might be 
made it is first necessary to admit a 
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factor of vowel intelligibility which is 
dependent upon the degree to which 
the several vowels of a given vowel sys- 
tem are mutually distinctive. High in- 
telligibility might then be considered 
some function of the number of dif- 
ference limens separating the vowels, 
which would in turn depend upon the 
separateness of the main resonance re- 
gions of the contrasted vowels. In terms 
of the first formant and the second 
formant as plotted on the familiar 
acoustic vowel diagram, the posited 
vowel intelligibility would thus be a 
function of the size of the vowel dia- 
gram, as opposed to its shape. In this 
view the larger the vowel diagram the 
more intelligible would be the speech. 
It seems possible that this variable of 
vowel diagram size would be in some 
degree independent of differences con- 
ferred by a dialectal factor. It would 
seem not unreasonable to suppose that 
increased vowel intelligibility, as de- 
fined by vowel diagram size, would be 
associated with speech excellence or 
with speech training. 

Another possible source of non- 
random variation in vowel utterance, 
which has as yet received no systematic 
study, is the kind of stress which com- 
monly results from forcefulness of 
utterance. Previous studies have shown 
that there is reason to believe that the 
several factors which enter into stress, 
such as length, fundamental pitch 
change, and intensity, are normally in- 
dependent of vowel formant position, 
but it would seem possible nevertheless 
that such stress might be accompanied, 
in certain lawful ways, by concomitant 
changes in resonance. That is, it is con- 
ceivable that a part of the stress of a 
vowel might result from a typical alter- 
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ation in its quality. Can a vowel be 
stressed by making it more intelligible? 
For example, is a stressed [i] more 
‘[i]-ish’ than an unstressed [i]? There 
is, of course, much good experience 
and evidence to show that this indeed 
is the case, and there is recognition of 
the obscuring of the vowel which oc- 
curs when a syllable loses its accent. 
Kenyon (5, p.90) points this out when 
he states that ‘It is characteristic of Eng- 
lish . . . that the vowels of unaccented 
syllables have gradually obscured to a 
sound quite different in resonance. . . .’ 
The degree and the direction in which 
this obscuring operates, the nature of 
such obscuring, and the presence of ob- 
scuring in normal variations in sense 
stress (as opposed to syllabic stress) 
have not been investigated systemati- 
cally as a function of formant frequency 
location. 

It was the purpose of this investiga- 
tion to assess the influence on vowel 
resonances of two factors: (a) the fac- 
tor of speech excellence, as this might 


* be assumed to be a result of speech 


training and experience, and (b) the 
factor of sense stress. In addition, a 
third factor, that of the presence or 
absence of a consonant context, also 
was investigated, for what light it might 
throw not only on differences between 
trained and untrained speakers, but also 
on the general question of the so-called 
isolated phoneme. 

Put in more concrete terms, this 
study makes an attempt to determine 
what, if any, significant acoustic dif- 
ferences might be found (a) between 
the vowel utterances of trained speak- 
ers and a matched group of untrained 
speakers of the same dialect charac- 
teristics, (b) between stressed and un- 
stressed vowels in the same context, 
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and (c) between vowels in context and 
vowels in isolation. 

Not only was it hoped that data 
could be provided pertinent to the 
problems of articulation improvement, 
but also it was hoped that light could 
be thrown on other problems con- 
cerned with the stability of the vowel 
resonance patterns. For example, do 
the patterns, as represented by the 
acoustic vowel diagram, vary in cer- 
tain predictable ways among and within 
speakers of the same language habits? 
Can vowel intelligibility be thought of 
as related to the general size of the 
vowel diagram, as opposed to its shape? 
Are stressed vowels merely louder, 
longer, and higher? Or do they char- 
acteristically occupy a different posi- 
tion on the vowel diagram? Are better 
speakers better because of prosodic fac- 
tors, semantic factors, and consonant 
articulation factors only, or do they 
perhaps also use vowel sounds which 
are less variable or of a different reso- 
nance? What relation does the quality 
of the isolated vowel phonation have 
to the quality of the vowel in context? 
These are some of the more specific 
questions which this research seeks to 
answer. 


Experimental Method 


The design of this experiment in- 
volved the measurement of the acoustic 
characteristics of each of 10 vowels 
li], [1], fe], fe], fe], [o], [ev], [ul, 
[a], and [a] and two diphthongs [ar] 
and [au] under three different condi- 
tions of speaking for two different 
groups of adult General American 
speakers. 


Speaking Conditions. There were 
two different conditions of sense stress 


for the 12 different vowel and conso- 
nant sounds in a [h__d] ccntext. Con- 
trastive sense stressing was obtained 
by devising six more or less sensible 
sentences, each of which contained 
two different [hd] words. These 
sentences were presented to the reader 
in pairs, with different words under- 
lined in each member of the pair, in 
such a way that if read properly the 
two readings would produce both a 
stressed and an unstressed reading of 
the same word .in the same context. 
One such pair was: 


They hid under the hood of the car. 
They hid under the hood of the car. 








The instructions to the readers were, 
‘Read the following in a natural tone, 
but making it very clear that the mean- 
ing differs in each pair according to 
the word stressed (indicated by the 
underlining).’ The six sentences em- 
ployed in the readings were: 

a. But he’d never had the warning. 





ae 


b. I don’t know who'd have thought 
to hide there. 

c. They hid under the hood of the 
car. 


d. Now how’d you like to be the 
head boss. 


e. The farmer hayed the field and 
hoed the potatoes. 


f. The old hod belonged to Mr. 
Hud. on; 

















This technique produced for analy- 
sis 12 vowels with phonetic and sen- 
tence contexts constant, but with two 
different conditions of sense stress. No 
instructions were given as to how stress- 
ing was to be carried out. Nearly all 
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speakers, trained and untrained, were 
able to carry out the instructions with- 
out difficulty. In the case of obvious 
errors the sentence was re-recorded on 
the spot under the same conditions and 
instructions. 


A third condition of speaking was 
that of the isolated vowel phonation. 
In this condition, which followed the 
reading of the sentences, subjects were 
presented with a list of the 12 [hd] 
words out of context and were told, 
‘For each of the following words read 
only the vowel sound.’ The trained 
speakers had no difficulty doing this. 
Most of the untrained speakers also 
found it simple to follow these direc- 
tions. Two of the speakers had to be 
trained, however. This training con- 
sisted simply of saying, ‘No, what I 
mean is say the vowel all by itself. For 
example, for the word [hsd] you 
would say only [3‘].’ After a few trials 
this was sufficient in all instances. No 
examples were given, of course, of the 
sounds under study. 


Speakers. The trained-speaker group, 
10 males, of this study was chosen from 
among the faculty and advanced grad- 
uate students of the Department of 
Speech, University of Washington. Al- 
though these people were not all of 
high artistic attainment or of beautiful 
voice quality, they demonstrated good, 
clear, articulate speech. They were all 
of the General American dialect and 
appeared to the experimenter to dem- 
onstrate no dialect peculiarities in their 
speech—at least insofar as the vowel 
enunciation was concerned. 

The untrained speakers, 10 males, 
were students in a beginning speech 
course. They had as yet received no 
significant amount of formal phonetic 


training, and were all judged by the 
experimenter to be ‘indifferent’ to 
‘poor’ with regard to what might be 
termed general articulation accuracy. 
All spoke with a General American 
dialect, however, and in this respect 
were not judged (by the experimenter) 
to be in any significant degree different 
from the trained speakers. In physical 
size they were approximately matched 
to the trained speakers, although there 
was no formal control placed upon this 
factor. The two groups of speakers dif- 
fered most obviously from one another 
in amount of experience and training. 


Analysis of Recordings. All phona- 
tions were recorded on an Ampex-600 
tape recorder in a relatively dead quiet 
room. These recordings were then 
dubbed onto the recording tape of the 
Kay Electric Company Sonagraph for 
analysis. 

An attempt was made to equate all 
vowels dubbed onto the Sonagraph re- 
corder in terms of overall intensity as 


. this may be determined from a VU 


meter. The vowels were dubbed at a 
level which would result in a —5 VU 
reading when the slow speed playback 
was monitored, regardless of whether 
the vowel was stressed, unstressed, or 
isolated. Dubbing was thus accom- 
plished with considerable monitoring 
of the recording volume inasmuch as 
stressed vowels were, of course, con- 
siderably more intense than the un- 
stressed. 


Acoustic Measures. For each vowel 
spectrogram four acoustic measures 
were obtained, in the following way: 

(a) A duration measure was made 
directly from the spectrogram. In gen- 
eral the duration of the vowel was de- 
fined as the duration of at least two 
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clear-cut formants in the appropriate 
frequency regions. 

(b) A measure of the fundamental 
frequency was made by averaging the 
number of vertical striations of the 
broad-band spectrogram over a_ .05- 
sec interval in the mid-portion of the 
vowel. This, of course, substituted a 
single frequency measurement, pre- 
sumed to be representative of that 
vowel because taken in its mid-portion, 
for the complex pattern of frequency 
change. 

(c) and (d) Measures of the frequen- 
cies of formants one and two were 
made directly from the broad-band 
spectrogram at a point one-fourth the 
duration of the whole vowel. This 
point was chosen somewhat arbitrarily 
as representative of the whole vowel 
because perhaps least affected by onset 
or terminating influences. 

For each of the two distinctive diph- 
thongs a somewhat different method of 
measurement was used for formants 
one and two. Two sets of measures 
were made: one of the formant posi- 
tions at the very onset of the diphthong 
and a second at its termination. 


Results 


Tables 1 and 2 present the basic data 
of this experiment in terms of mean 
measurements for 10 subjects in each 
of two groups. Table 1 gives mean data 
for the duration and fundamental fre- 
quency measures. Table 2 gives both 
means and standard errors for the first 
formant and second formant measures, 
for trained and untrained speakers, and 
for the three conditions of phonation 
(stressed, unstressed, isolated). 

Tables 3, 4, and 5 present a summary 
of the only statistical analyses thus far 


attempted. Although it may appear 
from Table 2 that all formant data 
could be subjected to an analysis of 
variance, several reasons, not excluding 
the inordinate amount of time involved, 
argue forcefully against this procedure. 
Tables 3, 4, and 5 present, instead, the 
difference scores obtained between cer- 
tain pairs of formant means, with dif- 
ferences significant at or beyond the 
5% level, as determined by a ¢ test, 
indicated. Table 3 presents the differ- 
ence scores comparing the trained with 
the untrained speakers. Table 4 com- 
pares stressed with unstressed phona- 
tions, and Table 5 compares the stressed 
with the isolated phonations. 

It can be hypothesized that in the 
process of articulation it is the relation- 


Taste 1. Duration (in seconds) and fre- 
quency (in cycles per second) means for 
each of 10 vowels phonated by 10 trained 
and 10 untrained speakers. Frequency meas- 
ures represent an average frequency over .05 
sec taken from the center of the vowel 
phonation. 





Vowel Stressed Unstressed 
Trained Untrained Trained Untrained 





(Duration) 








i] 163 159 096 123 
1] 168 154 128 124 
e] 182 .180 143 132 
e] 107 124 -100 -107 
2] Fy 159 123 129 
o| 184 189 131 156 
u| 147 oh 35 092 101 
ul okZ3 178 119 All 
al 281 247 266 220 
Al 188 170 ok 77 156 
Mean 174 170 138 136 
(Frequency) 

i] 193 187 126 137 
1] 162 175 121 132 
e] 183 175 132 146 
e] 171 169 120 121 
2] 204 186 135 134 
0} 152 154 134 138 
u| 177 176 106 127 
ul 184 206 113 134 
al 121 153 83 111 
A] 145 138 82 123 
Mean 169 172 115 130 
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Taste 2. Means and standard errors for the first two formant positions for 10 trained and 


10 untrained speakers. 








Vowel Untrained Trained 

Unstressed Stressed Isolated Unstressed Stressed Isolated 

M SE M SE M SE M SE M SE M SE 
(Formant 1) 
{i] 395 20 400 15 350 10 375 13 395 9 300 7 
[1] 480 13 505 16 485 15 440 16 515 13 450 15 
[e] 520 =20 510 16 560 14 505 14 500 13 495 16 
[e] 595 17 615 15 600 21 590 14 620 13 615 20 
[x] 710 18 760 21 745 22 680 24 725 26 720 23 
[o] 545 14 530 20 555 23 540 10 455 20 495 26 
[u] 560 16 570 11 540 16 505 9 530 15 480 17 
[ul] 450 18 485 15 435 21 410 14 440 24 360 12 
a] 700 10 725 20 705 16 750 17 750 18 700 11 
[a] 680 17 740 19 680 21 675 15 730 19 680 8 
Jar] 730 =20 785 36 805 28 700 31 810 46 775 21 
[ar] 435 27 415 26 475 42 415 24 385 25 405 31 
fav] 785 26 805 25 785 18 720 31 733 26 850 15 
[au] 690 44 660 26 470 = 48 700 27 656 30 340 37 
(Formant 2) 
[i] 2170 §=66 2345 41 2305 36 2255 35 2355 27 2450 41 
{1] 2075. 47 2215 22 2095 56 2155 42 2180 24 2150 36 
[e] 2150 38622 2175 25 2105 59 2190 50 2250 31 2185 58 
[fe] 1930 = 2 1960 47 1970 54 1900 50 1985 42 2025 56 
[x] 1770 19 1845 39 1830 42 1785 27 1865 43 1955 55 
[o] 935 36 940 54 935 55 900 15 805 32 765 18 
[vu] 1300 = 48 1275 40 1210 46 1210 34 1140 33 1110 52 
Ju] 935 55 1025 34 1056 = 49 922 40 930 30 765 29 
Ja] 1125 40 1150 33 1145 35 1205 17 1235 25 1110 47 
[a] 1380 = 60 1370 37 1315 40 1385 22 1370 15 1265 33 
Jar] 1335 36 1230 28 1295 31 1340 46 1275 46 1265 25 
[at] 1920 §=40 2045 50 2135 74 1995 48 2100 53 2310 48 
[av] 1320 0 3=—55 1300 30 1345 60 1295 32 1300 41 1350 32 
[au] 1210 3663 1090 54 695 67 1220 50 1144 38 525 51 





ship among the resonances of the sev- 
eral vowels which determines rec- 
ognition, rather than the resonance 
frequency per se. Figures 1 and 2 are 
presented to show these relationships 
graphically on a conventional acoustic 
vowel diagram with the first formant 
plotted against the second formant and 
with coordinates arranged according to 
the mel scale. In Figure 1 the formant 
coordinates for the three conditions are 
compared for trained speakers only. 
Figure 2 gives the same data for the 
untrained speakers. 


Frequency and Duration. The fre- 
quency and duration measures of Table 
1 indicate clearly that the stressed pho- 
nations were indeed given more promi- 
nence by the speakers, at least insofar 
as the measures employed here are con- 
cerned. Duration averages for stressed 
vowels were about .17 sec and for un- 
stressed less than .14 sec. The differ- 
ences are clearly statistically significant. 
It is also clear that stressed vowels were 
significantly higher in pitch than the 
unstressed, with stressed vowels aver- 
aging about 170 cps and unstressed 
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Taste 3. Mean difference scores* showing the comparison between trained and untrained 
speakers on the basis of formant frequency measures, all measures in cycles per second. 








Vowel Stressed Unstressed 
Formant 1 Formant 2 Formant 1 Formant 2 

{i] ¥ —10 20 —85 
(1] —10 35 40 —80 
[e] 10 —75 15 —40 
[e] -5 —25 5 30 
{z] 35 —20 30 —15 
[o]} 757 135+ 5 35 
[vu] 40f 135+ 55+ 90 
{uj 45 957 40 13 
[a] —25 —857 —507 —80 
[a] 10 0 5 — 5 
[at] —25 —45 "30 —5 
[ar] 30 —55 20 —75 
[au] 72 0 65 25 
{av] 4 —54 —10 —10 





“Difference scores are obtained by substracting the mean for the 10 trained speakers from the 


mean for the 10 untrained. 


{Statistically significant at or beyond the 5% level. 


about 122 cps. Such findings are, of 
course, in agreement with many previ- 
ous studies on the acoustical nature of 
accent. 

On the whole, pitch and duration 


trends were similar for both trained 
and untrained subjects. One interesting 
pitch difference, however, is clearly 
evident from Table 1. It will be noted 
that whereas both groups used about 


Taste 4. Mean difference scores* showing the comparison between stressed and unstressed 
vowels on the basis of formant frequency measures, all measures in cycles per second. 








Vowel 4 Untrained Speakers Trained Speakers 
Formant 1 Formant 2 Formant 1 Formant 2 

[i] 5 175+ 20 100+ 
{1] 25 1407 754 25 
{e] —10 25 -5 60 
[e] 20 30 30 85+ 
[2] 507 75 45+ 807 
[o] —15 5 —857 —95+ 
[v] 10 = 25 —70 
[u] 354 90 30 8 
[a] 25 25 0 30 
[a] 40t -10 55+ -15 
[at] 55 -105F 1107 —657 
[ar] —20 125 —30 105+ 
[av] 20 —20 13 5 
[av] —30 120} -4++ —76 





*Difference scores are obtained by subtracting the mean for the unstressed vowel from the 


mean for the stressed. 


{Statistically significant at or beyond the 5% level. 
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Taste 5. Mean difference scores* showing the comparison between stressed and isolated 
vowels on the basis of formant frequency measures, all measures in cycles per second. 








Vowel Untrained Speakers Trained Speakers 
Formant 1 Formant 2 Formant 1 Formant 2 

[i] 50t 40 957 —957 
[1] 20 120 657 30 
{e] —507 70 5 65 
[e] 15 —10 5 —40 
{z] 15 15 5 —90 
[o] —25 5; —40 40 
[v] 30 65 50 30 
[u] 50 -—31 807 165+ 
[a] 20 5 50 1257 
[a] 607 55 507 105+ 
[ar] —20 —657 35 10 
[ar] —60 —90 —20 —210 
fav] 20 —45 —117} —50 
[av] 190} 395+ 316} 619+ 





*Difference scores are obtained by subtracting the mean for the isolated vowel from the 


mean for the stressed. 


{Statistically significant at or beyond the 5% level. 


the same pitch for the stressed vowels 
the trained speakers were consistently 
lower in pitch on the unstressed. This 
might suggest that appropriate unstress- 
ing is more a mark of the successful 
speaker than is appropriate stressing. 
Formant Location as a Function of 
Speech Training. As Tables 2 and 3 
show, there are some indications that 
speech training is a factor influencing 
vowel quality, although there are obvi- 
ous and complex interactions between 
this factor and the vowels and stresses. 
The data in Table 2 seem to indicate 
some slight tendency for consistency 
and training to go together, at least as 
that consistency is measured by an 
among-speakers variability. If only the 
context vowels are taken into account, 
then out of a total of 56 standard exror 
measurements (14 vowels and diph- 
thongs, 2 stresses, 2 formants), the un- 
trained speakers show the greater vari- 
ation in 33 cases, the trained in 23. This 
is a result which presumably could be 


due to chance alone about once out of 
four times. The trend is clearest in the 
case of the second formants for the 
‘pure’ vowels, where 15 out of 20 meas- 
ures are less variable for the trained 
speakers than for the untrained. 


As indicated in Table 3, mean differ- 
ences between trained and untrained 
speakers show slight tendencies for 
vowel quality to vary in consistent and 
lawful fashion. Differences between 
groups are significant for four of the 
stressed vowels and two of the un- 
stressed. In general, first formant posi- 
tions were lower for the trained speak- 
ers. The most obvious exception is in 
the case of [a]. Of 28 first formant 
differences there are only six exceptions 
to the rule of a lower first formant for 
trained speakers than for untrained. 
Second formant measures give a differ- 
ent picture. There is a tendency for 
the back vowels (except for [a]) of 
trained speakers to have lower second 
formants, while front vowels are, for 
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the most part, higher in second formant 
postition (8 out of 10 measures). All of 
the statistically significant differences 
are for the back vowels. 

The possible significance of the above 
differences is more evident on compari- 
son of Figures 1 and 2, in which the 
interaction between stressing, training, 
and formant ratios can be better evalu- 
ated. The comparisons are highly com- 
plex and at this point the reader is left 
to formulate his own hypotheses con- 
cerning the differences between Figures 
1 and 2. These figures will be discussed 
in the following paragraphs. 


l'ormant Location as a Function of 
Condition of Phonation (Stressed, Un- 
stressed, Isolated). There does not ap- 
pear to be any well defined tendency 
for variability of formant position and 
stress to go together. A slight tendency 
does exist for the second formant meas- 
ures to be more variable in unstressed 
vowels. However, this tendency is 
somewhat weak and in only a very few 





cases do the differences appear to ap- 
proach statistical significance. 


The relationship between formant 
frequency means and stress is found to 
be very complex and to involve impor- 
tant interactions between stress and 
training. There are, however, certain 
interesting trends to be observed in the 
data of Tables 2 and 4. Of the 40 for- 
mant differences obtained between 
stressed and unstressed measures, 31 
show a higher formant position for the 
stressed phonation. Of all of the sig- 
nificant differences found—of which 
there were 13 for the vowels—only two 
(both for [o0]) showed a reversal of 
this trend. Of all of the vowels meas- 
ured only [e], [uv], and [a] failed to 
show some significant differences, for 
either the trained or the untrained 
groups, for either the first formant or 
the second formant. There appears to 
be a tendency for stressing differences 
to be greater for the trained than for 
the untrained speakers. 
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FORMANT TWO 


Ficure 1. Acoustic vowel diagram for isolated, stressed, and unstressed vowels as produced by 
the trained speakers. Coordinates are based upon the average of 10 speakers. The scales are 
mel scales, with a mid-section compressed for the formant two scale. 
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Changes of considerable magnitude 
are also apparent for the diphthongs. 
Second formant changes were large 
and significant for both the beginning 
and ending of [at] for both trained 
and untrained speakers and for the end- 
ing of [au] for untrained speakers. 

Table 5 shows the magnitude and 
the significance of differences between 
the stressed and isolated phonations. 
Tendencies are seen to vary markedly 
but to be generally in the direction of 
lower formants for the isolated vowels 
(32 out of 40 comparisons). There are 
slight tendencies to find greater differ- 
ences for trained speakers than for un- 
trained speakers in this respect. Differ- 
ences are particularly distinctive for the 
high front vowels, the low back vowels, 
and the diphthongs. Attention should 
be drawn especially to the very large 
differences between the context and 
the isolated [av]—pronounced as a 
rather extreme diphthong in isolation 
but almost as a vowel [a:] in speech. 





Figures 1 and 2 present the complex 
data of the tables in a somewhat more 
comprehensible form, on a vowel dia- 
gram with the first formant plotted 
against the second formant. They are 
offered here without any attempt at 
statistical analysis, but as a means 
whereby the reader may form his own 
hypotheses regarding the influence of 
stress and training on vowel quality. 
There does appear in these figures a 
tendency for the vowel diagram to 
grow smaller from isolated vowel to 
stressed vowel to unstressed vowel. 
Vowels seem to move toward a neutral, 
or at least a central, point on the vowel 
diagram as they lose energy in con- 
text. This is a somewhat weak trend, 
of course, but it may be seen operating 
to some extent for the [i], [e], [ez], 
{u] and [o]. The trend is somewhat 
less obvious for [a] and [a], as one 
might expect, and is to some extent 
reversed for [1] and [e]. Incomplete 
as the data are it does appear possible 
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FORMANT TWO 
Ficure 2. Acoustic vowel diagram for isolated, stressed, and unstressed vowels as produced 
by the untrained speakers. Coordinates are based upon the average of 10 speakers. The scales 
are mel scales, with a mid-section compressed for the formant two scale. 
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to hypothesize that for trained speakers, 
at least, stressing and context influence 
vowel diagram size, and probably more 
than they influence vowel diagram 
shape. 

Figure 2, the diagram for the un- 
trained speakers, by no means shows 
the same extent of diagram shift as does 
Figure 1. To be sure, when only 
stressed and unstressed vowels (in con- 
text) are examined the same tendency 
is observed, though with less strength 
({a], [o] and [u] are apparent rever- 
sals). The isolated phonations appear 
not to have exactly the same relation- 
ship to the context phonations as they 
did for the trained speakers. For the 
trained speakers the isolated phonation 
seems more often to appear as a kind 
of exaggeration of the normal vowel 
quality, whereas for the untrained 
speakers it appears more as merely a 
kind of representative of the average 
vowel quality. 


Discussion 


It was the stated purpose of this 
study to assess the influence on vowel 
resonance of speech excellence, sense 
stress, and vowel isolation. Significant 
differences were found, for some vow- 
els at least, indicating that these factors 
may be of importance in determining 
the precise vowel color for any given 
context. Although differences were 
complex and contradictory in some 
ways it does seem possible to hypothe- 
size a kind of vowel intelligibility which 
may be related to the size, as opposed 
to the position or shape, of the vowel 
diagram. With regard to the questions 
asked in the introduction, stressed 
vowels may not be merely longer, 
higher, and louder than unstressed. 
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They may also be different in terms 
of vowel resonance patterns as well. 
The differences, moreover, appear to 
be differences of pattern, involving 
the vowel diagram as a whole, such 
that a stressed vowel, for example an 
[i], is not only longer, louder, and 
higher, but is also more ‘[i]-ish’ than 
an unstressed vowel. 

It may be possible now to fit the 
present data into a three-part hypothe- 
sis which would account for most of 
the nonrandom -sources of variation in 
vowel recognition. In terms of a bi- 
dimensional model, the vowel system 
can be conceptualized as a series of 
first formant (F1), second formant 
(F2) coordinates. Vowel recognition, 
in this system, may be thought of as 
a process of discriminating among these 
points. This job of discrimination will 
be aided by the fact that for any one 
person repeating a single allophone un- 
der constant conditions, variations of 
these points will be essentially random 
in nature and within rather well de- 
fined limits. 

Speech recognition depends, how- 
ever, upon ability to understand more 
than a single person and condition and 
therefore upon ability to adapt to 
other than random sources of variation. 
In two dimensional terms this and pre- 
vious research indicate that recognition 
may depend upon ability to adapt to at 
least three kinds of nonrandom varia- 
tion, as illustrated in Figure 3. 

First illustrated, in Figure 3A, is the 
situation in which the only sources of 
variation would be random. The ex- 
tent of such variation would determine 
the relative reliability of the coordi- 
nates which fix the corners of diagram 
A, as indicated by the large dotted 
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Ficure 3. Graphic illustration of four kinds of vowel variation given in terms of the acoustic 
formant one, formant two vowel diagram. Ordinate is formant one; abscissa is formant two. 


circles, but it would not determine the 
relationship among the circles. 

The second kind of vowel variation, 
illustrated in 3B, is that of dialect, per- 
haps the most difficult kind to which 
the listener must adapt. The figure 
illustrates the vowel coordinates and 
relationships for two different (hypo- 
thetical) languages or dialects. 

The third kind of vowel variation is 
suggested in 3C which illustrates the 
variation encountered between adults 
and children, or as a result of changes 
in overall resonator size. In this case 
the vowel relationships are relatively 
constant but the positions of the vowel 
diagrams within the F1-F2 space are 
shifted. Success in understanding the 
speech of both men and children testi- 
fies to ability to make such an adapta- 
tion; that is, as long as vowel diagram 


shape remains constant, its position may 
change, within limits, without inter- 
fering with recognition. The limits of 


“such change have not yet been well 


defined. 

The fourth kind of variation is that 
which has to some extent been demon- 
strated in the present study. This is 
illustrated in Figure 3D. Here the vowel 
diagram shape remains the same and 
the location in the F1-F2 space is 
roughly. equivalent, but the size has 
changed. The tendency for the stressed 
vowels to produce a diagram which 
fills more of the available F1-F2 space, 
together with the tendency for trained 
speakers to show somewhat more ex- 
treme vowel coordinate positions when 
producing isolated vowels, would lend 
support to 3D as illustrative of an im- 
portant source of variation. 

















Sources of Variation in Vowel Quality: Tiffany 317 


One possible hypothesis arising from 
a consideration of this fourth kind of 
variation is the following: other things 
being equal, the larger the vowel dia- 
gram size, the better the vowel intel- 
ligibility. How might such an acoustic 
hypothesis be applied to speech train- 
ing problems and what would it mean 
in terms of physiological phonetics? 
Perhaps it may be said that for increased 
intelligibility the tongue should be 
higher and more forward for high 
front vowels, and lower and more to 
the back for the back vowels. Perhaps 
it should simply be said that more ex- 
treme articulatory positions are char- 
acteristic of more careful articulation 
or of stress. The present investigation 
has, of course, not yet proved the point. 

This study has, it would seem, made 
it possible to advance some hypotheses 
in the area of vowel intelligibility. It is 
left for further study, however, to de- 
termine the limits within which vowel 
diagram size may be manipulated with 
good retention of intelligibility. As a 
matter of fact the whole question of 
the limits within which the F1-F2 space 
may be manipulated with respect to 
size, position, and shape deserves fur- 
ther attention. 


Summary 


Nonrandom sources of variation in 
vowel quality were studied by means 
of acoustic measures of duration, fun- 


damental frequency, and frequencies of 
the first two formants of isolated vow- 
els and of stressed and unstressed vow- 
els in an [h—d] context for 10 trained 
and 10 untrained speakers. Although 
results indicated complex and variable 
relationships, there was some evidence 
that, in general, the differences among 
the several vowel resonances tended to 
be greater for the stressed than for the 
unstressed vowels, and greater for the 
trained than for the untrained speakers. 
There was also evidence that isolated 
vowels produced by trained speakers 
have more extreme articulatory po- 
sitions than the same vowels in context. 


A hypothesis was suggested to the 
effect that vowel intelligibility may be 
considered as a kind of function of 
vowel diagram size as well as of shape, 
location, and reliability. 
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Articulation Skills, Velopharyngeal Closure, 


and Oral Breath Pressure 


of Children with Cleft Palates 


DUANE C. 


GENE R. POWERS 


This study deals with children with 
cleft lips and palates and the relation- 
ships among their articulation skill, oral 
breath pressure, and velopharyngeal 
closure as revealed by lateral head 
xX rays. 


Oral pressure has long been recog- 
nized as one of the requisites in the 
production of speech sounds. Stetson 
and Hudgins (12), Hudgins and Stet- 
son (6), Stetson (11), and Black (1) 
have studied the amount of breath pres- 
sure in the oral cavity of normal speak- 
ers during speech. They have shown 
that oral breath pressure is greater dur- 
ing the production of fricatives and 
plosives than it is either for other types 
of consonants or for vowels. They also 
have found that the pressure is greater 
during the production of voiceless con- 
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sonants than during the production of 
their voiced cognates. 

The pattern of articulation errors of 
individuals with cleft palates is of con- 
siderable interest in view of the find- 
ings summarized above. Investigators 
such as Bzoch (2), Counihan (3), Mc- 
Williams (8), Spriestersbach and others 
(9), and Starr (10) have found that 
individuals with cleft palates misarticu- 
late with greatest frequency those 
sounds for which oral breath pressure 
is highest for normal speakers. These 
findings suggest that cleft palate speak- 
ers typically have difficulty building 
up enough air pressure in the oral cav- 
ity for the adequate production of 
speech. The most reasonable explana- 
tion for this lack of pressure is that 
there is an abnormal air leak through 
the velopharyngeal valve during artic- 
ulation. 

Films of lateral head x rays have been 
used extensively to assess the adequacy 
of velopharyngeal closure.* Although 
x-ray films provide data of major diag- 
nostic value, they deal with two spatial 
dimensions only and do not permit as- 
sessment of the volumetric dimensions 
of the oral and nasal cavities. The clini- 
cian needs to be able to augment the 


1158 from the National Institute Mental ; : . 
Health, Public Health Service. x-ray data with more direct evaluations 
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of the functional adequacy of closure. 
Measurement of oral breath pressure is 
one way to provide this functional 
assessment. 


Procedure 


The 103 subjects in the present study 
consisted of 38 girls and 65 boys rang- 
ing from five through 15 years of age. 
The mean IQ for the entire group as 
measured by the Wechsler Intelligence 
Scale for Children was 93.6. Eight of 
the group had socially-handicapping 
hearing losses, that is, an average loss 
of 25 db or greater in the better ear 
for the speech frequencies. Eight of 
the group had clefts of the lip only, 
23 had clefts of the palate only, and 72 
had clefts of both lip and palate. Of the 
95 subjects with palatal clefts, seven had 
unrepaired cleft palates, 19 were wear- 
ing obturators, and 69 had had surgical 
closure of the palate. The physical man- 
agement of the subjects had been car- 
ried out at various centers in four 
midwestern states. 

Articulation skill was measured by 
the Templin diagnostic picture articu- 
lation test. This test elicits 176 word 
responses and allows for the evaluation 
of all of the speech sounds in a variety 
of phonetic contexts. Furthermore, 
normative data are available which 
make it possible to compare an indi- 
vidual’s score on the test with the 
scores of physically normal children of 
the same age and sex. For the purposes 
of this study a ratio was computed be- 
tween the individual’s articulation score 
and the norm for his age and sex. The 
ratio was used in an attempt to mini- 
mize the effects of maturation by tak- 
ing into account the normal pattern of 
development of articulation skills. A 


ratio of 1.00 was assigned if the per- 
formance of a given subject equalled 
the norm. Thus, the lower the ratio the 
less favorable was the individual’s per- 
formance in comparison to the norm. 
This ratio will be referred to as the 
Templin ratio. The number of cor- 
rect responses for 72 pressure sounds 
(36 plosives and 36 fricatives) elicited 
during the Templin test was used as 
a second articulation measure. The spe- 
cific sounds were arbitrarily chosen 
but were uniform for all subjects. They 
were selected to sample all of the frica- 
tive and plosive sounds in the initial, 
medial, and final positions. Singles and 
blends were sampled equally for both 
fricatives and plosives. 

Two measurements of oral pressure 
also were obtained. One was a measure 
of vital capacity obtained by the use of 
a wet spirometer which had been modi- 
fied to require a minimal amount of 
oral pressure to activate the equipment. 
The vital capacity measurements were 
made with the nostrils open and oc- 
cluded. Ratios were computed between 
the two measures in the manner sug- 
gested by Kantner (7) to indicate effi- 
ciency of velopharyngeal closure. If 
the two measures were equal, a ratio 
of 1.00 was assigned. If the measure 
with the nostrils occluded exceeded 
that with the nostrils open, a ratio of 
less than 1.00 was assigned. 

Another set of pressure scores was 
obtained by using an oral manometer 
designed to me*sure air pressure in 
ounces per square inch. This equipment 
was available for the testing of only 
41 of the 103 subjects. Measures were 
taken with the nostrils open and also 
with the nostrils occluded and ratios 
were computed in the same manner as 
for the vital capacity measurements. 
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Taste 1. Comparison of oral breath pressure ratios of a group of cleft palate subjects who achieved 
velopharyngeal closure and a group who did not achieve velopharyngeal closure as revealed by lateral 
head x rays. 








Closure Group 





No Closure Group 





N Mean Ratio N Mean Ratio t tou 
Wet Spirometer 34 .99 40 .68 5.98 2.36 
Manometer 10 .98 19 .70 3.13 2.47 








X-ray data were obtained for each 
subject phonating the vowel [u] and 
also sustaining the fricative sound [s]. 
Measurements were made of the degree 
of velopharyngeal opening under both 
of these conditions only for those sub- 
jects who had had cleft lips only or 
cleft palates that had been surgically re- 
paired. The measurements were made 
in millimeters and represented the mini- 
mal distance between the palate and the 
posterior pharyngeal wall or lymphoid 
tissue. 


Results 


A gross dichotomy was made be- 
tween the subjects who achieved clos- 
ure while sustaining either [u] or [s] 
and those who did not. The mean 
ratios for the wet spirometer and for 
manometer mecsurements for the two 
groups (closure and no closure) and 
results of the ¢ tests are reported in 
Table 1. In spite of the fact that the 
degree of velopharyngeal opening was 
not taken into account for the group 
that did not achieve closure, the dif- 
ference between the mean ratios for 
the two groups is significant beyond 
the 1% level. 

The absolute oral breath pressure 
data also are of interest. They were 
obtained by use of the oral manometer 
and were available for 10 subjects with 
closure and 19 subjects with no closure. 


The subjects with closure did as well 
with the nostrils open as they did with 
the nostrils occluded, the means being 
15.6 oz and 15.3 oz, respectively. In 
contrast, the no-closure group achieved 
a mean of 7.2 oz with the nostrils open 
and 11 oz with the nostrils occluded. 
The most reasonable explanation for 
the difference between the mean pres- 
sures achieved by the closure and no- 
closure groups with the nostrils oc- 
cluded is that this difference is due 
to sampling and experimental error 
rather than to an inherent difference in 
the blowing ability of the two groups. 
Finally, the two means, each above 
15 oz per square inch, obtained for the 


‘good closure group, place this group 


in the ‘average’ or ‘moderately high’ 
categories of intraoral breath pressure 
levels attained by normal children in 
Goddard’s (4) study. On the other 
hand, the mean of 7.2 oz per square 
inch obtained for the poor closure 
group with the nostrils open falls in the 
‘low’ pressure category established by 
Goddard for normal children. In fact, 
only 6% of Goddard’s 200 normal sub- 
jects obtained scores of 7 oz per square 
inch or less and those that did were, 
with one exception, five-year-olds, her 
youngest age group. These data and 
clinical impressions resulting from the 
testing of a wide variety of subjects 
with clefts not included in the study 
suggest that air pressure of 8 oz per 
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TaBLE 2. Correlation coefficients between various oral-breath-pressure, x-ray, and articulation measures 
obtained from a group of subjects with cleft palates. 











Variables Correlated N r T01 

Wet spirometer ratio and manometer ratio 40 .65 .39 
Wet spirometer ratio and Templin ratio 99 -40 20 
Manometer ratio and Templin ratio 41 .50 .39 
Wet spirometer ratio and frequency of correct production of 

selected plosive sounds 69 53 .33 
Wet spirometer ratio and frequency of correct production of 

selected fricative sounds 69 48 33 
Wet spirometer ratio and velopharyngeal opening while pho- 

nating [ul] 65 -.69 33 
Wet spirometer ratio and velopharyngeal opening while 

producing [s] 64 -.62 .33 
Manometer ratio and velopharyngeal opening while pho- 5 s 

nating [ul] 23 -.58 51 
Manometer ratio and velopharyngeal opening while pro- 

ducing [s] 23 -.69 51 
Templin ratio and velopharyngeal opening while phonat- 

ing [u] 67 -.45 33 
Templin ratio and velopharyngeal opening while produc- 

ing [s] 67 -.51 .33 








square inch is the minimum amount 
necessary for the satisfactory produc- 
tion of consonant sounds requiring oral 
breath pressure. 

As indicated earlier, eight subjects 
had cleft lips with no involvement of 
the palate and seven subjects had un- 
repaired clefts of the palate. These two 
groups probably represented the ex- 
tremes of velopharyngeal competency. 
The mean wet spirometer ratio was 
1.00 for the cleft-lip-only group and 
36 for the unrepaired-palatal-cleft 
group. The mean Templin ratio was 
1.00 for the cleft-lip-only group and 
60 for the unrepaired-palatal-cleft 
group. Again the discrepancy in the 
means for the two groups emphasizes 
the importance of velopharyngeal com- 
petency both in developing oral pres- 
sure and in adequate articulation skills. 

The strength of relationship for a 
number of pairs of variables was esti- 
mated by Pearson’s correlation proce- 


dure. The obtained 7s, reported in 
Table 2, are all significant beyond the 
1% level. The relationship between the 
wet spirometer ratios and the manome- 
ter ratios is fairly substantial (r = .65). 
Inspection of the data, however, reveals 
that for some individuals one of the 
measures was a better predictor of velo- 
pharyngeal closure and articulation abil- 
ity whereas for other individuals the 
other measure was a better predictor. 
For example, a few individuals who 
were tested on the manometer were 
able to impound air in the buccal cavity 
by valving between the tongue and 
palate since the manometer functions 
as a Closed air-system. In these in- 
stances, the pressure measurements ob- 
tained on the manometer were spuri- 
ously high. However, these same indi- 
viduals could not use this valving tech- 
nique when being measured on the wet 
spirometer because it functions as an 
open air-system. As a consequence, dis- 
crepancies between the two pressure 
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ratios were occasionally observed for 
some individuals. 

The 7 for estimation of the relation- 
ship between the wet spirometer ratios 
and the Templin ratios is .40. The rather 
low strength of this relationship is 
not surprising in view of the fact that 
production of many of the sounds 
tested in the Templin test require little, 
if any, oral breath pressure. Further- 
more, factors other than inadequate 
pressure can account for the misarticu- 
lations. 

An r of .50 was obtained by correlat- 
ing the manometer ratios to the Temp- 
lin ratios. Although this coefficient is 
somewhat higher than that reported for 
the wet spirometer ratios and the 
Templin ratios, it is based on fewer 
subjects. Since the reduction in the 
number of subjects extends the confi- 
dence interval, the difference between 
the two coefficients cannot be meaning- 
fully interpreted. The two rs are not 
independent and no test of the differ- 
ence between them is available. 

The wet spirometer ratios were cor- 
related with the frequency of correct 
productions of 36 plosive sounds and 
of 36 fricative sounds elicited in the 
Templin test. To minimize the effects 
os immaturity of articulation develop- 
ment as a basis for the misarticulations, 
only data pertaining to those subjects 
who were eight years of age or older 
were used. Although for this group the 
fricative sounds were generally more 
defective than the plosive sounds, the 
frequency of fricative errors did not 
correlate more highly with the wet 
spirometer ratios than did the number 
of plosive errors. The Subtelny and 
Subtelny data (13) reveal that a very 
small velopharyngeal opening is likely 
to result in defective fricatives, sug- 


gesting that the relationship between 
frequency of fricative misarticulations 
and oral breath pressure is not linear. 
If the relationship is not linear in this 
instance, it follows that the correlation 
coefficient will be reduced. 

The elimination of the younger sub- 
jects reduced the number of manome- 
ter readings to the point where corre- 
lations between the frequency of cor- 
rect productions of the 36 fricative 
sounds and of 36 plosive sounds and 
the manometer ratios were not justified. 

The wet spirometer and manometer 
ratios were correlated with the extent 
of velopharyngeal opening as deter- 
mined from the x-ray film. The nega- 
tive correlations shown in Table 2 in- 
dicate that, in general, the lower the 
pressure ratios, the greater the velo- 
pharyngeal opening. These data do not 
reveal any important advantage in the 
use of x-ray films taken during the pro- 
duction of the vowel [u] or the con- 
tinuant [s] in predicting adequacy of 
oral breath pressure as measured either 
by the wet spirometer or the oral ma- 
nometer. 

Two more correlations were ob- 
tained from the Templin ratios and the 
x-ray measurements. The degree of re- 
lationship between the articulation and 
x-ray measures is roughly comparable 
to that between the articulation and 
pressure measures. The correlation co- 
efficient of —.51 obtained by relating 
the Templin ratio to the minimal velo- 
pharyngeal distance during the produc- 
tion of [s] is of a somewhat lower 
order than the coefficient of .78 ob- 
tained by Hagerty and Hoffmeister (5) 
when they studied the relationship be- 
tween the minimal velopharyngeal dis- 
tance during the production of [s] and 
an articulation score. The difference 
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may be due to the relative number of 
pressure and nonpressure sounds con- 
tained in the two articulation tests al- 
though Hagerty and Hoffmeister do 
not describe their articulation test in 
detail. The correlation coefficient of 
—.45 obtained from the Templin ratio 
and the minimal velopharyngeal dis- 
tance during the phonation of [u] may 
be compared to Counihan’s (3) results. 
He also obtained x-ray measurements 
of velopharyngeal closure while his 
subjects phonated the vowel [u]. These 
measures were correlated with articu- 
lation scores and scale values obtained 
from ratings of ‘understandability.’ 
The obtained coefficients were .33 and 
41, respectively. Wildman reports 
somewhat higher correlations between 
various measures of velopharyngeal 
adequacy obtained from x rays during 
the production of [u] and the amount 
of nasal emission during the production 
of selected plosive and fricative non- 
sense syllables. 


The role of oral breath pressure in 
the articulation of speech sounds can 
be studied by examining the results of 
the pressure measurements and articula- 
tion tests of those subjects whose 
scores on these tests were normal or 
nearly so. Data are from 54 subjects 


TaBLE 3. Comparison of wet-spirometer and 
Templin ratios of subjects (N=14) with clefts of 
of the palate only and subjects (N=52) with 
clefts of the lip and palate. 








Lip and Palate Only 
Palate 


Mean Ratio Mean Ratio t 





Wet Spirometer .84 .64 2.32* 
Templin Test .80 .65 2.82t 








*Significant at 5% level. 
{Significant at 1% level. 


with wet spirometer ratios of .95 or 
better and 38 subjects with Templin 
ratios of .90 or better. The mean wet 
spirometer ratio of the 38 subjects with 
Templin ratios of .90 or better was .95 
and of the total group of 103 was .79. 
On the other hand, the group of 54 
with wet spirometer ratios of .95 or 
better obtained a mean Templin ratio 
of .88 while the total group of 103 ob- 
tained a mean Templin ratio of .75. 
High Templin ratios are thus more 
likely to be accompanied quite con- 
sistently by high wet spirometer ratios 
than the reverse. These data seem to 
substantiate the clinical impression that 
good articulation is almost always as- 
sociated with a high pressure ratio 
whereas a high pressure ratio is not 
always associated with good articula- 
tion. 

For analysis of the articulation and 
pressure differences between the group 
of subjects who had a cleft of the pal- 
ate only and those who had a cleft of 
both the lip and palate, only the sub- 
jects who had had surgical manage- 
ment of the palatal cleft were used. 
Comparisons were made between the 
wet spirometer ratios and the Templin 
ratios of the two groups. Table 3 re- 
veals that the group with cleft palate 
only had significantly lower means 
than the group with cleft lip and palate. 
This finding is in agreement with that 
of Bzoch (2), who found that pre- 
school children with clefts of the palate 
only had significantly more articulation 
errors than those with clefts of the lip 
and palate. Clinical reports have indi- 
cated that the nature and size of the 
cleft and the amount of tissue avail- 
able for the group with cleft palates 
only are such that the results of physi- 
cal management are frequently poorer 








324 Journal of Speech and Hearing Research 


for this group than for the group with 
cleft lips and palates. It appears that 
there may be physical and physiolog- 
ical bases for the differences observed 
between the subjects with different 
types of clefts. 


Discussion 


The correlations between the vari- 
ous measures used in this study are rela- 
tively low. The absence of higher rela- 
tionships is not surprising in view of 
individual variability and the complex- 
ity of the speech process. It should be 
remembered that each of the measures 
was obtained only once for each sub- 
ject. No research has been done to de- 
termine how stable the x-ray and pres- 
sure measures are on repeated tests of 
individuals with cleft palates. Further- 
more, it is obvious that the adequacy 
of articulation development depends 
on more than the preciseness of velo- 
pharyngeal closure and the amount of 
oral breath pressure that the speaker 
can develop. Other possible factors, for 
example, are faulty learning and dental 
deviations. In fact, it is somewhat sur- 
prising to find that the relationships 
under study here are as strong as they 
are. Yet almost without exception sub- 
jects who were unable to develop ade- 
quate breath pressure showed marked 
deficiencies in articulation. 


The cleft-lip-only group had normal 
air pressure and articulation skills com- 
parable to those of physically normal 
children of the same age and sex. These 
findings suggest that the limitations 
imposed by the labial and dental devi- 
ations which typically accompany a 
cleft lip do not materially affect articu- 
lation proficiency, at least when they 
do not exist in conjunction with those 


structural deviations created by a cleft 
palate. On the other hand, the subjects 
in the unrepaired-palatal-cleft group 
did have appreciable articulation profi- 
ciency even though they had a marked 
lack of structural continuity and also 
major oral pressure deficiencies. The 
results provide support for the often- 
mentioned observation that speakers 
are frequently able to compensate for 
oral structural deficiencies and that the 
adequacy of articulation skills cannot 
be predicted solely on the basis of the 
adequacy of the articulatory mecha- 
nism. 

It would appear that prediction of 
articulation performance would be 
equally successful from either x-ray 
data or pressure measurements. How- 
ever, the results do not suggest that 
one set of measures should be used to 
the exclusion of the other since the 
highest relationship between the x-ray 
data and pressure measurements is only 
.69. For some individuals x-ray data are 
more indicative of velopharyngeal com- 
petency whereas for others oral-breath- 
pressure measurements are better indi- 
cators. 

The purpose for the employment of 
pressure measures is not to predict ac- 
curately what the articulation of a 
given subject will be but rather to 
assess better the bases for the articula- 
tion problems observed. Individual 
variability argues for many different 
kinds of observations. Pressure measure- 
ments in conjunction with x-ray and 
articulation data should be useful in 
identifying the etiological factors in- 
volved in speech problems and thus 
make possible more intelligent planning 
for future physical and therapeutic 
management. 
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Summary 


Lateral head x rays, intraoral breath 
pressure measurements, and Templin 
articulation scores were obtained on 
103 children with cleft lips and _pal- 
ates. 

The main findings were as follows: 
(a) Measures of intraoral breath pres- 
sure were related to velopharyngeal 
competency and articulation scores. 
(b) All of the subjects with cleft lips 
only had, in general, normal oral pres- 
sure and articulation scores. (c) Sub- 
jects with good articulation scores al- 
most invariably had high pressure 
scores; subjects with high pressure 
scores occasionally had relatively poor 
articulation scores. (d) Individuals with 
cleft lips and palates, in general, 
achieve better articulation scores and 
higher breath pressure than do individ- 
uals with cleft palates only. (e) Meas- 
urement of oral breath pressure of 
speakers with cleft lips and palates 
can be of major diagnostic value in 
establishing the bases for the articu- 
lation problems. 
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Calling Attention to Stuttering 


M. E. WINGATE 


The phenomenon of stuttering has re- 
ceived concentrated attention in the 
past three decades and during this time 
many hypotheses have been advanced 
to explain its nature. Prominent among 
those which have developed in this 
period has been the ‘semantogenic’ or 
‘evaluational’ theory of Wendell John- 
son (13, 15, 16), which asserts that stut- 
tering develops as a result of critical 
evaluation and disapproval of ‘normal 
nonfluency’ in the early stages of 
speech development and persists be- 
cause the individual strives to avoid the 
nonfluency which was disapproved. 
This position thus makes disapproval a 
central factor in the etiology and per- 
sistence of stuttering. It stresses that 
one should not be critical of nonfluen- 
cies since disapproval has the effect of 
increasing the speaker’s effort to avoid 
nonfluency, an effort which is reflected 
in exaggerated pausing, repetition, and 
strain. This admonition has also been 
stated inversely: it is bad practice to 
praise a stutterer for fluent speech since 
the net effect is to strengthen his con- 
viction that he should never speak non- 
fluently (17, p. 458). 





M. E. Wingate (Ph.D., University of 
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ticle is based on a doctoral dissertation done 
at the University of Washington under the 
direction of Dr. Charles R. Strother and is an 
adaptation of a paper presented at the 1957 
Convention of the American Speech and 
Hearing Association, Cincinnati. 


Volume 2, No. 4 326 


Wischner (27), in his description of 
stuttering as a learned avoidant re- 
sponse, has based his thinking on John- 
son’s position. In answering the ques- 
tion of what the stutterer is attempting 
to avoid, Wischner quotes Johnson’s 
statement that stuttering is an avoid- 
ance reaction ‘designed to avoid the 
nonfluency which the individual has 
learned to fear and dread and expect’ 
and ‘an attempt to avoid the nonfluency 
that was originally disapproved.’ 
Wischner agrees that the stutterer’s 
anxiety is based on this original dis- 
approval but adds: 

Within the present learning theory frame 

of reference, however, it would appear to 

be more accurate to say that the adult 
. Stutterer is avoiding, not nonfluency as 
such, but the original consequences (nox- 
ious stimulation) which attended the orig- 

inal nonfluent behavior (p. 329). 

By ‘the original consequences’ Wisch- 
ner means ‘painful reactions’ within the 
individual elicited by the reactions of 
those judging his speech. 

The notion that being critical of a 
stutterer’s speech only serves to make 
it worse has received widespread ac- 
ceptance although support for it has 
been largely on a common-sense basis 
with only implicit or indirect support 
from a few experimental studies. Cer- 
tain studies have reported relationships 
between the frequency of stuttering 
and (a) the stutterer’s own focus on 
his speech (14, 18, 19, 20, 23); (b) the 
effect of various distractions (1, 2, 19); 
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or (c) varying degrees of situational 
stress (8, 10, 23, 25). Studies employing 
experimental conditions in which ex- 
plicit attention is called to the stutter- 
ing in some way have been extremely 
few in number. Studies by Maddox 
(22) and Van Riper (24) were oriented 
in this direction: Maddox reported 
stuttering to increase when the subjects 
observed themselves in a mirror while 
speaking; Van Riper reported an in- 
crease in stuttering when the subjects 
were threatened with an electric shock 
for each instance of stuttering. These 
findings, especially those of the latter 
study, seem to have had the effect of 
making further work in this area appear 
unnecessary. More recently Sheehan 
(24), using the writings of Wischner as 
his point of departure, reported a study 


in which he had stutterers repeat each’ 


stuttered word until the word was said 
fluently. This procedure resulted in a 
decrease in stuttering. Sheehan views 
stuttering as an approach-avoidance 
conflict and interpreted these results as 
indicating an ‘unlearning’ of the ‘avoid- 
ant response.’ One could as well main- 
tain that the obtained decrease in stut- 
tering was occasioned by the permissive 
atmosphere of the experimental condi- 
tion which effected a decrease in what 
is generally referred to in the literature 
as ‘situational anxiety,’ or that a ‘set’ 
to speak more fluently was communi- 
cated to the subjects. 

In view of the theory that stuttering 
is aggravated if a speaker’s speech diffi- 
culties are called stuttering and are 
labeled as undesirable, and because con- 
clusive supporting evidence is lacking 
in the literature, further investigation 
would appear indicated to discover 
what happens to a stutterer’s speech 
when his ‘blocks’ or moments of diffi- 


culty are evaluated directly and criti- 
cally. 


Problem 


The present study was designed to 
test the hypothesis that amount of stut- 
tering will decrease under conditions 
in which emphasis is placed on speak- 
ing without stuttering and then calling 
attention to stuttering whenever it 
occurs. 


Method and Procedure 


The subjects were 18 male stutterers 
ranging in age from 15 to 39, with a 
mean age of 22. All of the subjects had 
stuttered since childhood and all had 
had some amount of speech correction. 
Using Shulman’s (25) criteria, subjects 
were eliminated who did not stutter on 
at least 2% of the words in the first 
passage spoken. 

In this study experimental conditions 
were devised so that the subjects com- 
municated verbally to another person, 
a procedure being considered a closer 
approximation of the real-life situation 
than would be the case in simply read- 
ing a paragraph, the technique used in 
many studies of stuttering. In this more 
‘communicative’ type of situation, the 
subject gave directions to an assistant 
to do various simple tasks. 


The material used consisted of three 
different passages, ranging in length 
from 251 to 257 words, made up of 
sentences averaging nine words in 
length, stating directions to do simple 
tasks. The three passages were pre- 
sumably of equal difficulty; the experi- 
mental design, however, insured that 
any effects due to possible differences 
among them were eliminated. 
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Taste 1. Reliability of judgments of words stuttered. 
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Correlation Between Judgments 





Experimenter (Original) 
Experimenter (Playback) 


Independent Observer 
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Each subject performed under three 
different conditions. Condition I was a 
neutral condition designed to provide 
a measure of the stutterer’s customary 
speech behavior against which to com- 
pare his performance in the experi- 
mental conditions. Therefore, Condi- 
tion I was always given first and each 
subject served as his own control. Con- 
ditions II and III were counterbalanced 
in order of presentation after Condition 
I. At least 24 hours intervened between 
any two of the three conditions; for 
most subjects the interval was 48 hours. 
Previous studies (8, 21, 27) have shown 
that the effect of adaptation’ dissipates 
rather quickly after an experimental 
session, 24 hours being more than ade- 
quate to eliminate this effect. Condition 
II required that the subject’s communi- 
cation be interrupted whenever he 
began to stutter and to accomplish this 
he had to speak into a microphone con- 
nected to earphones worn by the as- 
sistant. To equalize the effect of the 
apparatus itself, the microphone and 
earphones were used in all three condi- 
tions. 

Each subject spoke the directions in 





“In stuttering literature ‘adaptation’ refers 
to the progressive decrease in stuttering oc- 
curring with repeated reading of the same 
material. 


each of the three passages to the assis- 
tant, who carried out each direction 
when the subject had finished speaking 
it. The subject spoke a different set of 
directions in each of the three condi- 
tions. In all three conditions the subject 
spoke the directions eight times in suc- 
cession, in order to provide a measure 
of the adaptation effect. To eliminate 
any effects that might have resulted 
from differences between the passages, 
they were counterbalanced among the 
three conditions and each passage fol- 
lowed each of the other passages an 
equal number of times. 

For all conditions the experimenter 
was stationed in an adjacent room be- 
hind a one-way mirror from where he 
could control the apparatus for the 
experimental conditions. Before assum- 
ing his station the experimenter in- 
structed the subject in what he was to 
do in the current condition. 

In Condition I he was to speak in his 
usual manner, 

In Condition II he was to complete 
the passage quickly but would not be 
permitted to continue his communica- 
tion until he had said a stuttered word- 
fluently. Whenever the subject began 
to stutter, the connection between the 
subject and the assistant was broken 
and a small light and buzzer signalled 
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Ficure 1. Frequency of words stuttered 
through successive readings in one neutral 
and two experimental conditions: Condition 
I, neutral, with no interruptions and no re- 
minders; Condition II, experimental, with in- 
terruptions; Condition III, experimental, with 
reminders. 


the subject that his communication was 
interrupted. 

In Condition III he was told to speak 
as fluently as possible and that he would 
be reminded of each instance of stutter- 
ing, but could continue on. He was 
‘reminded’ by the recording of each 
error on an _ electrically activated 
counter placed in front of him. 

The experimenter followed the sub- 
ject’s course through each condition on 


copies of the particular passage in use, 
drawing a line through every word 
stuttered. A separate copy of the 
passage was used for each of the eight 
readings in any one condition. In all 
three conditions count was not made 
of those instances in which the subject 
checked himself when about to stutter 
but did not actually do so. 


Reliability 


In order to check the experimenter’s 
reliability in the judgment of words 
stuttered, a sample of each subject’s 
performance in all conditions was re- 
corded. These recordings were later 
played back, at which time the experi- 
menter and another speech pathologist 
made independent judgments of the 
stuttering, marking them in the same 
manner as described above. The judg- 
ments from all recordings were pooled 
and three phi coefficients were com- 
puted to measure the correlations be- 
tween the three sets of judgments 
(Table 1). The phi coefficient was con- 
sidered the most appropriate statistic 
since it reflects the extent of agreement 
as to which words were stuttered rather 
than simple concurrence on the number 
of words stuttered. The correlations, 


TaBLE 2. Summary of ¢ test (df=17) and results for evaluation of differences in amount of stuttering 
between Condition I (neutral) and Condition II (interruption). 











Reading My My Du oD t P 
1 18.72 7.78 10.94 2.64 4.14 .01 
2 15.61 5.67 9.94 3.22 3.09 .O1 
3 16.72 4,94 11.78 3.92 3.01 01 
4 17.01 3.78 13.23 4,23 3.06 01 
5 14.83 3.61 11.22 4.09 2.74 .02 
6 14.50 3.72 10.78 3.58 3.01 .O1 
6 15.17 3.28 11.89 4.31 2.76 .02 
8 14.56 3.28 11.28 3.96 2.85 .02 
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TaBLE 3. Summary of ¢ test (df=17) and results for evaluation of differences in amount of st7.tering 
between Condition I (neutral) and Condition III (reminding). 











Reading My My Dy oD t deg 
1 18.72 td 11.55 3.21 3.60 01 
2 15.61 5.56 10.05 3.43 2.93 01 
3 16.72 5.11 11.61 4.35 2.67 .02 
4 17.01 4.83 12.18 4.43 2.75 .02 
5 14.83 3.89 10.94 4.32 2.53 05 
6 14.50 4.67 9.83 4.03 2.44 05 
7 15.17 4.33 10.84 4.37 2.48 -05 
8 14.56 4.78 9.78 3.99 2.45 -05 








ranging from .83 to .87, indicate very 
satisfactory reliability of judgment and 
are consistent with figures reported in 
previous studies (2/, 24, 25). 


Results 


As indicated by the curves in Figure 
1, there was a decreasing frequency of 
stuttering through the eight readings 
in all conditions. The curve for Condi- 
tion I starts at a mean value of 18.72 








stutterings in reading J and drops ir- 
regularly to a mean of 14.56 in reading 
8, a difference significant at the .10 
level. This decrease is presumably due 
to the adaptation effect. The curves for 
Conditions II and III are quite similar. 
They start at means of 7.78 and 7.17, 
respectively, and drop in a fairly regu- 
lar fashion to means of 3.28 and 4.78 in 
the last reading. These differences from 
first to last readings are significant at 
the .01 and .02 levels, respectively. 





Ficure 2. Frequency of stuttering of individual subjects. Lines connecting circles indicate Con- 
dition I, neutral, with no interruptions and no reminders; lines connecting dots indicate Con- 


dition II, experimental, with interruptions; dashed lines indicate Condition III, experimental, 
with reminders. 
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The effect of the two experimental 
conditions is clear; both differ signifi- 
cantly from the neutral condition in all 
of the eight readings (Tables 2 and 3). 
The differences between Condition I 
(neutral) and Condition II (interrup- 
tion) are significant beyond the .02 
level; the differences between Condi- 
tion I and Condition III (reminding) 
are significant beyond the .05 level. 
Thus the null hypothesis tested can be 
rejected at these respective confidence 
levels. The differences between each of 
the experimental conditions and the 
neutral condition are evident from the 
first reading on, indicating that what- 
ever factor operated to produce these 
differences was effective from the very 
beginning. 

Analysis of the behavior of individual 
subjects yields some interesting results. 
Graphs plotted for each subject (Figure 
2) reveal that all except four can be 
said to have shown some amount of 
improvement under one or both of the 
experimental conditions. These 14 di- 
vide themselves into two groups: eight 
who showed considerable or marked 
improvement, and six whose improve- 
ment was slight. In some measure the 
proportion of subjects showing only 
slight improvement is inflated due to 
the fact that roughly half of these sub- 
jects stuttered relatively little in the 
neutral condition, thus could show 
little improvement. 

Inspection of the individual graphs 
reveals that, by and large, the curves 
for the two experimental conditions 
show less fluctuation and are flatter 
than those for the neutral condition. 
With very few exceptions these curves 
do not look like learning curves, that 
is, they do not show a pattern of decre- 
ment from the first to last reading 


which would depict learning. Analysis 
of the data for these curves lends sup- 
port to this impression in that there is 
also no evidence of progressive im- 
provement within any one reading; if 
each reading is divided into five 50- 
word segments and the number of stut- 
terings in each of the five segments is 
tallied, these totals show little differ- 
ence from one segment to another. This 
is true for the readings in all three 
conditions. 


Discussion 


It is of considerable interest that the 
differences between the control and 
experimental conditions in this study 
are significant from the first reading. A 
very likely interpretation of this find- 
ing is that the subjects assumed a set 
to speak without stuttering (that is, a 
set to avoid speaking nonfluently), 
were able to maintain this set through- 
out the eight readings, and were able to 
improve their performance even though 
attention was called to their stuttering. 
The similarity of results for the two 
experimental conditions indicates that 
it made little difference whether atten- 
tion was called to the block when it 
first appeared or after it was completed. 

With the interpretation that the de- 
crease in stuttering obtained in this 
study reflects the operation of a set to 
speak fluently, serious question is raised 
as to whether the stutterer’s character- 
istic guiding motivation is simply to 
‘avoid nonfluency.’ The results ob- 
tained suggest that if this were the stut- 
terer’s primary motivation he should be 
able to speak with less stuttering in 
ordinary situations also, On the con- 
trary, then, it would appear that char- 
acteristically the stutterer is not so 
motivated and, in fact, is for some 








332 Journal of Speech and Hearing Research 


reason unable to assume such a set. It 
is reasonable to believe that stutterers 
may avoid some social contacts, or 
avoid speaking in certain situations, or 
even avoid certain words;? and one 
should heed the stutterer’s claim that he 
wishes to speak without stuttering. But 
to contend that he stutters in an effort 
to avoid nonfluency seems unwar- 
ranted, especially in view of the find- 
ings of the present study. Further, to 
explain the variation in appearance of 
stuttering in the individual as due to 
changes in ‘fluency anxiety’ begs the 
question. Undoubtedly anxiety is pres- 
ent, but it is questionable that this is 
simply fluency anxiety; it seems more 
plausible that the stutterer is not so 
much anxious about his speech as he is 
more or less anxious while speaking. 
The sources of this anxiety remain un- 
identified, and most likely vary with 
the individual. 

It is felt that the findings of this 
study have important implications re- 
garding the notion that stuttering re- 
flects avoidance behavior, specifically 
avoidance of stuttering or avoidance of 
nonfluency. Emphasis on stuttering as 
avoidance behavior, as well as focus on 
the symptom per se, has drawn atten- 





*No little attention has been accorded the 
apparent fact that the stutterer finds certain 
sounds and sound combinations difficult and 
thus tends to avoid them. Yet a large number 
of studies investigating the effect of formal 
elements of language structure on stuttering 
(3, 4, 5, 6, 7, 9, 10, 11, 12) have demonstrated 
that the mechanical features of speech are not 
directly related to stuttering. However, one 
of these studies, by Eisenson and Horowitz 
(9), included an evaluation of the role of 
meaningfulness of the speech content and re- 
ported that this factor was directly related 
to stuttering. These authors made the signifi- 
cant comment that meaningfulness of speech 
content is a highly complex factor which in- 
terrelates with others of a personal nature. 


tion from the obvious fact that speak- 
ing is first and foremost man’s basic 
means of communication. Speaking is 
not only activation of the speech 
mechanism resulting in certain sound 
patterns but is more importantly a 
process that involves (a) the conveying 
of a message and (b) a relationship of 
some kind between speaker and auditor. 
It would appear profitable to turn at- 
tention to these factors and give con- 
sideration to the place of stuttering in 
the total process of communicating and 
relating to another person. 

This theoretical position suggests it- 
self: the stutterer’s motivation is to 
maintain communication, and this moti- 
vation is based on and expresses a deep 
need for interpersonal relations. Stut- 
tering is learned and persists because, 
even though a block may be momen- 
tarily unpleasant when it occurs, it is 
nevertheless reinforced by virtue of the 
fact that it ultimately contributes to 
maintaining communication (and may 
have secondary gain value as well). 
‘That is, even when a stutterer blocks 
and his speech pattern is disturbed, he 
is still in a communicative relationship 
with his auditor. Assuming that the 
interpersonal relationship is important 
to the speaker, the block and the acts 
incident to overcoming it are as effec- 
tive in maintaining this relationship as 
are the speech elements that precede 
and follow it. Perhaps the block is 
more effective to this end by virtue of 
possible attendant secondary gains 
(sympathy, attention). 

Implications for a Learning-Theory 
Interpretation of Stuttering. In several 
studies note has been made of the simi- 
larity between the phenomena observed 
in studies of stuttering and the results 
obtained in laboratory studies of learn- 
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ing. For example, the adaptation effect 
is likened to experimental extinction 
and the increase in stuttering following 
a time interval after adaptation is 
likened to spontaneous recovery. The 
results of the present experiment sug- 
gest that such changes may simply re- 
flect fluctuations in the amount of 
anxiety present rather than the occur- 
rence of learning. 

Wischner (27, 28, 29) draws the par- 
allel that stuttering resembles the in- 
strumental avoidance type of learning. 
In this learning paradigm, a response 
is elicited to a conditioned stimulus or 
cue which has been associated with 
punishment. By making the appropriate 
response to the cue, the organism avoids 
the punishment. For Wischner, the cue 
is a word or a speech situation, the 
punishment is the painful reactions to 
disapproval, and stuttering is the avoi- 
dant response. However, this formula- 
tion of stuttering does not adhere 
closely to the learning theory model: 
the stuttering (avoidant response) con- 
tinues to be disapproved (punished), 
that is, the stuttering does not avoid the 
punishment, yet stuttering persists. 
There are further difficulties in Wisch- 
ner’s position: although it emphasizes 
the unpleasantness which the stutterer 
anticipates, it does not account for the 
aspect of unpleasantness reported to 
occur during stuttering. Also according 
to his formulation, punishment is a nec- 
essary element in causing stuttering, a 
point criticized earlier. 

A later publication by Wischner 
(30) lends support to an alternative 
hypothesis mentioned by him earlier 
(27), namely, the possibility that stut- 
tering behavior is learned and persists 
because of the close association with 
tension-reduction when the word is 


finally completed. Such ‘an hypothesis 
could well incorporate the factor of 
the discomfort during the stuttering 
act. It is possible that this neglected 
aspect of the stuttering sequence may 
offer the key to an adequate account 
of the mechanics of stuttering behavior. 

If one wishes to conceive of stutter- 
ing as a form of learned behavior ac- 
quired through the agency of some 
painful stimulus source (fear, anxiety) 
it would seem that the instrumental- 
escape learning paradigm offers a better 
model. In this type of learning situation 
the animal learns to escape noxious 
stimulation by performing an act that 
stops the stimulation. The principle 
here is that by making a certain re- 
sponse the animal escapes from an 
aroused state of tension, and the re- 
sponse is reinforced because it results 
in tension reduction. Applied to stut- 
tering, this model would offer the fol- 
lowing account: in the course of speak- 
ing the stutterer reaches certain points 
wherein the level of tension in his 
attempt to communicate is raised. The 
speech musculature shares in this in- 
creased tension (probably in greater 
proportion since it is the muscle group 
in focus) and a block occurs. In the 
effort to continue communication, the 
individual strives to overcome the 
block, which effort is expressed as stut- 
tering. Completion of the stuttered 
word brings tension reduction. The 
stuttering act is thereby instrumental 
in escape from aroused tension and is 
thus reinforced. 

With this analysis in mind one might 
look again at the early period of speech 
development when stuttering is noted 
to arise most frequently. In acquiring 
the articulatory coordinations necessary 
for speech, the young child stumbles 
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and fumbles on many speech sounds as 
he attempts to express himself, and 
periods of fluency alternate with pe- 
riods of nonfluency. If no additional 
stress is introduced into the perform- 
ance of this activity then both types of 
behavior would be equally reinforced. 
Presumably due to increasing matura- 
tion, which is attended by greater 
facility in fine coordinative movements, 
articulation becomes increasingly pre- 
cise and fluent speech thus receives 
more and more reinforcement. How- 
ever, if additional stress is introduced 
into the speech situation, the child’s 
level of tension in communicating is 
raised. Such additional stress may be 
created by any untoward emotional 
relationship between the child and his 
auditor(s), of which an adult’s dis- 
approving attitude toward the child’s 
speech is but a part. Whatever the na- 
ture of the child’s communicative be- 
havior under these conditions, his 
speech, whether fluent or stuttered, re- 
ceives more reinforcement because it 
is attended by a greater amount of ten- 
sion reduction. : 

A conception of stuttering as instru- 
mental-escape learning offers these“ ad- 
vantages: (a) it is not embarrassed in 
its account of the role of disapproval 
in stuttering, since it need not formu- 
late disapproval of speech pattern as 
a necessary nor the critical factor in 
stuttering, (b) it provides a theoretical 
position consistent with widely em- 
ployed treatment procedures which aim 
to help the individual modify his stut- 
tering responses and control his fears, 
and (c) it allows for a variety of 
dynamics underlying the tension sur- 
rounding the speech act. It also allows 
consideration of the still very real 
possibility that some stutterers may be 


inferior in terms of neuromotor func- 
tion; that is, in some individuals more 
than others, fine coordinative acts 
(such as those involved in speech) 
might be more difficult to learn, might 
be more susceptible to the pressures of 
situational stresses, and blocks thus 
more readily occur. 


Summary 

After appraising the theory that stut- 
tering is an attempt to avoid nonfluency 
and that conveying a critical attitude 
toward stuttering makes it worse, this 
study tested the hypothesis that stut- 
terers would actually stutter less under 
conditions in which they were made 
aware of their stuttering. Recordings 
were made of the performance of 18 
stutterers, speaking 250-word passages 
eight times in succession under three 
conditions, one in which they were 
permitted to speak in their usual man- 
ner and two in which attention was 
called to their stuttering. The results 
showed significantly less stuttering in 
the two experimental conditions. Anal- 


* ysis of the data suggests the interpreta- 


tion that these results were due to 
subjects’ assuming a set not to stutter, 
that is, to avoid speaking nonfluently, 
there was no evidence that learning 
occurred. The suggestion is made that 
consideration of the stutterer’s motiva- 
tion should take account of the econ- 
omy of stuttering in the total process 
of communication. Note is made of 
implications in this study regarding 
description of stuttering as instrumen- 
tal-avoidance learning. It is suggested 
that the instrumental-escape learning 
paradigm appears to provide a more 
suitable model than the instrumental- 
avoidance paradigm for analyzing stut- 
tering in terms of modern learning 
theory. 
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Post-exposure Residual Effects 


of Low-Level Noise 


WILLIAM J. TRITTIPOE 


In a previous study (2, 3) the residual 
effects of low-level noise on the tem- 
porary threshold shift (TTS) were 
demonstrated. It was shown that noise 
of a level insufficient to produce a sig- 
nificant TTS will significantly modify 
the TTS following exposure to a more 
intense sound. Specifically, when a 15- 
minute exposure to white noise at 78 db 
SPL (which alone produced no TTS) 
preceded a three-minute exposure to 
white noise at 118 db SPL, the TTS 
following the second exposure was 
greater than when the same exposure 
was preceded by a control condition of 
15 minutes of silence. This might be 
termed a ‘pre-exposure residual effect.’ 
The present study is concerned with 
the question of whether a post-exposure 
residual effect can be similarly demon- 
strated. That is, if a low-level -noise 
which in itself is insufficient to produce 
a measurable threshold shift follows a 
high-level noise which does produce 
a measurable threshold shift, will the 
threshold shift be greater than when 
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the high-level noise is followed by a 
period of silence of a duration equiva- 
lent to that of the low-level noise? 


Procedure 


Terminology. Presentation of sound 
at a level known to produce a TTS 
will be referred to as a ‘principal ex- 
posure.’ Presentation of sound at a level 
at which that sound alone produces no 
apparent TTS will be referred to as a 
‘sensitizing exposure.’ When a sensitiz- 
ing exposure precedes a principal ex- 
posure the former will be termed the 
‘sensitizing pre-exposure.’ When the 
sensitizing exposure follows a principal 
exposure, the sensitizing exposure will 
be referred to as the ‘sensitizing post- 
exposure.’ If a period of silence pre- 
cedes or follows the principal exposure, 
that period of silence will be termed, 
respectively, the ‘silent pre-exposure’ or 
the ‘silent post-exposure.’ 


Method. The TTS was measured fol- 
lowing two conditions of exposure to 
a principal exposure noise level: (a) a 
control condition (upper section Fig- 
ure 1) in which a principal exposure 
was followed by a silent post-exposure; 
(b) an experimental condition (lower 
section, Figure 1) in which a principal 
exposure was followed by a sensitizing 
post-exposure. The noise level of the 
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Figure 1. Schematic diagram of control and 
experimental conditions. The top section rep- 
resents the control condition; the bottom sec- 
tion represents the experimental condition. 
The two conditions are identical, except that 
for the control a period of silence followed 
the 120-db noise exposure, and for the experi- 
mental condition a noise of low level fol- 
lowed the 120-db exposure 


principal exposure was 120 db SPL. 
The duration of the entire test was 
23 minutes. The first seven minutes 
were devoted to threshold measure- 
ments at 4000 and 6000 cps, alternately. 
This was followed by a three-minute 
principal exposure. Immediately after 
the principal exposure, there was a 
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Ficure 2. Measurement schedule for the test 
frequencies employed. Time is represented 
on the abscissa, and a relative db scale is 
represented on the ordinate. The test fre- 
quencies were 4000 and 6000 cps. The trac- 
ings on the upper half of the figure show 
the measurements made prior to sound ex- 
posure. The tracings on the lower half show 
the measurements made after sound exposure. 
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three-minute silent post-exposure for 
the control condition, a three-minute 
sensitizing post-exposure for the experi- 
mental coli The post-exposure 
noise levels were 70 and 80 db SPL. 
Following the sensitizing post-exposure, 
and following the silent post-exposure, 
recovery threshold measurements were 
made at 4000 and 6000 cps for 10 
minutes. 

Figure 2 illustrates the threshold 
measurement schedule employed in the 
tests. Time is represented on the ab- 
scissa, and a relative decibel scale is 
represented on the ordinate. The trac- 
ings on the upper half of this figure 
represent the threshold measurements 
made prior to exposure to the principal 
exposure. The tracings on the lower 
half of the figure represent the sequence 
of measurements after the post-exposure 
condition. The difference in decibels 
between the measurements made before 
the principal exposure, and after the 
sensitizing post-exposure condition, de- 
fines the TTS. Only the latter of the 
two sets of measurements at 4000 cps, 
obtained prior to the principal expo- 
sure, was employed in the computations. 


Subjects. Seven young adults with 
normal hearing served as listeners. Each 
listener contributed both ears to the 
tests. Each ear of each listener was ex- 
posed twice to the control condition, 
and twice to each experimental condi- 
tion. (Each ear of two additional lis- 
teners was exposed to one run under 


*The noise intensities are in decibels re 
0.0002 dynes/cm*. These sound pressure levels 
were measured by a standard 6-cc coupler 
and a 640 AA, Western Electric condenser- 
microphone system. The noise had a uniform 
spectrum level, and the noise levels reported 
represent the over-all output of the earphones 
from the coupler-microphone system. 
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each condition.) In addition, each ear 
of the nine listeners was exposed once 
to the higher sensitizing noise level 
(80 db) for six minutes, This was nec- 
essary to establish that the sensitizing 
noise levels by themselves produced no 
apparent TTS. These tests demon- 
strated 0 db of TTS after about 30 sec- 
onds following noise exposure. It was 
assumed that a 70-db exposure for six 
minutes also would have demonstrated 
0 db of TTS. 

The order of presentation of the ex- 
perimental and control conditions was 
randomized, and at least 24 hours 
elapsed between successive exposures. 
If the listener’s threshold had not recov- 
ered to within 5 db of his pre-exposure 
threshold, testing was delayed for at 
least another 24 hours. 


Instrumentation. A Békésy-type, con- 
tinuous-recording audiometer was used 
to obtain all threshold measurements. 
A single PDR-10 earphone, encased in 
a MX41/AR cushion and mounted in a 
binaural headset, was employed in all 
the tests. The earphones were not re- 
moved during any test including the 
period of silence for the control condi- 
tion. The threshold test frequency was 
interrupted by an electronic switch. It 
was ‘on’ for approximately 0.4 sec and 
‘off’ for 0.4 sec. The rise-fall time was 
50 msec. All threshold measurements 
were made in an I.A.C. audiometric 
testing room. The ambient noise level 
in the testing room was 35 db SPL, 
with most of the energy concentrated 
below 70 cps. The noise exposures were 
of uniform spectrum with the band 
width limited only by the headphone’s 
characteristics. For the principal expo- 
sure the noise was turned on slowly, 
that is, it took about seven seconds for 


the level to go from 0 db to 120 db 
SPL. 


Results and Discussion 


The results averaged over the group 
are presented in Figure 3. The TTS is 
represented on the ordinate; the post- 
exposure measurement time is repre- 
sented on the abscissa. Each point rep- 
resents the average of 28 runs, An in- 
spection of the curves in Figure 3 shows 
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Ficure 3. Average TTS (N=7) at 4000 cps 
as a function of the post-exposure measure- 
ment time. The thick, solid curve represents 
the control condition. The dotted curve and 
the thin, solid curve represent the 70- and 


-80-db post-exposure conditions, respectively. 


that for the data averaged over the 
group, both the 70- and 80-db post- 
exposure conditions produced a greater 
TTS than the control condition. This 
differential between conditions is appar- 
ent even 10 minutes after sound cessa- 
tion. Differences between the experi- 
mental and the control conditions were 
examined by ¢ tests for correlated 
measures. Significant differences were 
observed between the 80-db experimen- 
tal condition and the control condition 
(p<0.02). However, the 70-db experi- 
mental condition and the control con- 


dition were not significantly different. ' 


Measurements made at 6000 cps for 
residual effects showed no significant 
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differences between the experimental 
and control conditions. In this analysis, 
each listener contributed one score 
under each condition (the TTS aver- 
aged over both ears for each listener 
for measurements made at 2-, 3-, and 
6-min post-exposure time). The two 
listeners who failed to complete the 
tests were not included in this analysis. 

All but one listener showed post- 
exposure residual effects. In only one 
case did the higher post-exposure noise 
level fail to generate a greater TTS 
than did the control. It should be noted 
that in a study concerned with pre- 
exposure residual effects (2) this same 
listener showed no residual effect.? 

Harris (/) in an earlier study has 
shown that a 0 db TTS may not repre- 
sent a complete recovery of the audi- 
tory system. By implication, the present 
study confirms Harris’ conclusion and 
further demonstrates that under special 
conditions, low noise levels may have 
a residual effect on the hearing system. 

It is concluded that the question 
asked in this study may be answered 
affirmatively; namely, that sensitizing 
post-exposure residual effects may be 
demonstrated. 


*In a paper concerned with pre-exposure ef- 
fects, Ward (4) has reported, using young 
adults as listeners, negative results. However, 
when he ran himself under the same con- 
ditions the results were positive. Dr. James 
M. Pickett has suggested that perhaps age is 
an important factor for demonstrating resi- 
dual effects. While a precise statement can- 
not be made as to the age of the listeners in 
this study, it appears that the older listeners 
exhibit a relatively greater residual effect than 
do the younger. 


Summary 


This study sought to answer the 
question of whether a sound, in itself 
insufficient to produce a temporary 
threshold shift (TTS), can modify 
the recovery threshold to a more in- 
tense sound. TTS measurements were 
obtained following two conditions of 
high-level noise exposure: (a) a control 
condition in which a period of silence 
followed a broad-band noise exposure; 
(b) an experimental condition in which 
the same exposure was followed by 
noise at levels which alone produce no 
measurable TTS. A 3-min exposure to 
a noise level of 120 db SPL was fol- 
lowed in the control condition by a 
3-min exposure to silence and in the 
experimental condition by a 3-min ex- 
posure to noise of 70 and 80 db, noise 
which alone produced no measurable 
TTS. Statistical analysis demonstrated 
highly significant differences between 
the experimental condition (80 db) and 
the control. The initial question is in- 
terpreted affirmatively. 
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Factors Related to Lipreading 


AUDREY ANN SIMMONS 


Some hearing-impaired individuals learn 
to read lips very well while others do 
not acquire very much skill even after 
long periods of training. The present 
study is concerned with: (a) the meas- 
urement of the ability to lipread and 
(b) the factors in the individual that 
may be related to the ease or difficulty 
with which he reads lips. It is not con- 
cerned with various aspects of the edu- 
cational procedures themselves. 


Measurement of Lipreading 


One major obstacle to establishing a 
relation between different characteris- 
tics of an individual and his ability to 
read lips is the lack of a quantitative 
measure of lipreading ability. There 
have been various attempts to devise 
tests. In 1928 Day, Fusfeld, and Pintner 


(1) created a series of tests that were - 


given in a face-to-face situation but the 
variability of speaking situations and 
among speakers contributed to low re- 
liability. 

The advent of motion pictures was 
accompanied by a series of develop- 
ments in which both the test mate- 
rial and the speaker of the material 
could be held constant from trial to 
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trial. In 1932 Mason (11, 12) described 
the use of films as an aid to instruction 
and later these same films were used as 
a test of comprehension. Heider and 
Heider (7) and Reid (20) also pre- 
pared films for test purposes. 

An effort at standardization of a 
film test on a population of 761 people 
was made by Utley (23, 24). Utilizing 
the Utley test, Heider (9) found it un- 
necessarily long for children but she 
indicated that it was in general a good 
measuring device. DiCarlo and Kataja 
(2) found the Utley film too long even 
with adults but they also found a high 
correlation (r = .77) between the re- 
sults of the Utley film and those of 
another film that had been designed for 
teaching. They also reported that hear- 
ing-impaired subjects gave lower scores 
than normal subjects not trained in 
lipreading, and this result was partially 
corroborated by O’Neill’s (15) demon- 
stration that normally hearing indi- 
viduals utilize lipreading considerably 
in comprehending speech. 

There appears to be no doubt that 
a filmed test of lipreading can test re- 
liably some aspect of lipreading, but 
many therapists seem to feel that the 
ability tested by films is not the same 
as the one that they would like to test 
—the ability to read lips in the every- 
day face-to-face situation. 

The present study represents an at- 
tempt to create a more valid measure 
than is now available of ordinary, 
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everyday lipreading through the use of 
an interview carried out in a face-to- 
face situation. Various scoring proce- 
dures, to be described below, yield a 
measure of lipreading derived from this 
interview situation, and this measure is 
used as an external criterion with which 
other tests can be correlated. 

The Interview Criterion. Five judges, 
who served also as speakers, were se- 
lected to carry out and quantify the 
interviews. All five were therapists with 
two or more years of experience with 
hard-of-hearing adults. 

Rating Scale. The judges were first 
asked to describe levels of ability in 
lipreading in such a way that a scale 
could be created. The following 
emerged as descriptive criteria under 
three general levels of lipreading abil- 
ity: 

Good: does not often miss what is 
said; lipreads without hesitation; 
follows shift in topic; can follow 
direct (to him) and indirect (be- 
tween and among others) conver- 
sation; can lipread profiles and full 
face; can lipread variety of speak- 
ers; needs no special distinctness 
on part ot speaker. 

Average: sometimes hesitates and re- 
quires a filling in; eventually gets 
what is said either through repe- 
tition or rewording without an ex- 
treme breaking down of what was 
said; lipreads some people better 
than others; has some difficulty in 
following change in topic; has 
some difficulty with individual 
conversation; has some ease with 
speech if it is distinct; requires a 
simplification of language on part 
of speaker. 

Poor: . requires frequent repetition; 
requires written clues; has only 


limited range of topic; can’t fol- 
low even when speech is re- 
worded; can’t pick up speech from 
context; needs to have lip move- 
ments exaggerated; needs slow 
rate; needs simple language. 

Use of the Rating Scale. The five 
judges together engaged each of 24 
hard-of-hearing subjects in conversa- 
tion. The order in which the judges 
entered the conversation remained con- 
stant. Topics common to all interviews 
were: weather, television, news, sports, 
hearing aids, children, and community. 
The interview was carried out with 
speakers talking at a level just below the 
threshold level that was required by 
each patient to recognize some speech 
through hearing alone. 

After each interview the judges re- 
viewed the conversation together and 
then individually evaluated the ability 
of the subject on the scale that they 
had created. For this purpose a graphic 
evaluation was indicated on a scale laid 
out on a 10-cm line with points labeled 
good, average, and poor, marked at dis- 
tances 2.5, 5.0, and 7.5 cm from the 
left end of the line. Numerical scores 
were thus obtained in centimeter units. 

An analysis of variance of the judges’ 
scores resulted in an estimate of the 
reliability coefficient of .92, a figure 
that indicates satisfactory reliability of 
the mean ratings used as criterion meas- 
ures. 

Correlation Between Criterion and 
Filmed Tests. Two silent movie tests 
were used, one by Utley (23, 24) and 
one by Mason (11). The Utley film 
(‘How Well Can You Read Lips?’) 
contains three kinds of material: Part 
I, a sentence test of 31 items including 
short phrases and full sentences; Part 
II, a word test of 36 words; and Part 
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Taste 1. Scores on the tests of lipreading. 
(Numbers in parentheses on Utley test scores 
are values obtained by Utley.) 











Mean S. D. 
Utley 
Part I 34.28 (33.63) 13.34 (16.36) 
Part II 5.67 ( 6.85)* 2.40 ( 3.80) 
Part III 5.58 ( 5.62) 5.17 ( 4.48) 
Total 45.53 16.64 
Mason 40.50 22.59 
Interview 4.71 3.22 








* Significant at the 5% level. 


III, a comprehension test in which are 
shown six short scenes involving one 
or more speakers. Following each story 
in this third part is a series of five ques- 
tions, after each of which the subject 
writes an answer of a sentence or word. 
The first two parts are in black and 
white, the stories are in color. The 
speaker in the first two parts is a young 
lady, while both males and females are 
involved in Part III. One point is given 
for each word correct in Parts I and 
II, and one point is given for éach cor- 
rect answer in Part III. The Mason film, 
in black and white, shows a young lady 
speaking 27 sentences. The subject’s 
score is obtained by giving one point 
for each correct word in the sentences 
with a total possible score of 178 points. 


Taste 2. Intercorrelations between 
and filmed-test measures of lipreading. 


interview 











Variable Interview Mason Total Utley 
Film Score 
Part I Utley .52* 5* .99* 
Part II Utley .37 A ba .68* 
Part ITI Utley 261° .59* A fi 
Total Utley .57* .80* 
Mason Film -61* 








* Significant at the 1% level. 


Results. Table 1 shows the means and 
standard deviations of the measures for 
the 24 hard-of-hearing subjects ob- 
tained on each of the three tests used: 
the two films and the interviews. There 
was a significant difference (t = 2.36, 
significant at the 5% level) between re- 
sults obtained by Utley in her. study 
and the results of the present study on 
Part II of the Utley test. Except for 
this part (involving a word test) sub- 
jects in the present study on the aver- 
age appeared to perform according to 
previous norms established for the 
Utley filmed test. No norms were 
available for the Mason film and, there- 
fore, a similar comparison could not be 
made. 

Intercorrelations among these meas- 
ures of lipreading are shown in Table 
2. There is a significant correlation be- 
tween the results of the interview and 
the total score on the Utley and the 
Mason filmed tests. Both of these cor- 
relations, while not high, are significant 
at the 1% level. The filmed tests and 


‘the interview obviously do not measure 


exactly the same thing but these find- 
gest that what they do measure 
fair amount of ability that is 
common ta,the two kinds of tests. 

All three parts of the Utley test are 
significantly correlated with the total 
score, and Part I has an extremely high 
weighting in the total score, which is 
necessarily the case, as may be noted 
from the relative sizes of the standard 
deviations reported in Table 1. The 
parts and the total score are also sig- 
nificantly correlated with the score on 
the Mason filmed test. 

These intercorrelations may also be 
compared with previous reports of the 
reliability of these test instruments. 
Utley has reported a reliability coeffi- 
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cient for the sentence test (Part I) of 
.928, computed from the correlation 
between two forms of Part I. A simi- 
larly computed reliability coefficient 
for Part II is .797. The story test (Part 
III) apparently did not have two forms 
but is correlated with the other parts 
as follows: with Part I, .69; and with 
Part II, .56. 

Conclusions. No attempt was made to 
compute a validity coefficient for the 
scores derived from the interviews. The 
assumption was that the way in which 
the scale was derived and the way in 
which the interview was carried out 
reflect a valid judgment of a person’s 
lipreading ability. On this assumption 
the criterion of lipreading ability is de- 
fined as a score on a rating scale derived 
from the interview. The correlation 
between either filmed test and this cri- 
terion is a measure of the validity of the 
two filmed tests. While the correla- 
tions among parts of a filmed test or 
between two filmed tests are relatively 
high, this correlation between either 
one and the criterion is somewhat 
lower, although still significant. It ap- 
pears, therefore, that these filmed tests 
do measure something related to lip- 
reading ability but that the something 
is only a part of the general ability that 
can be scored in a face-to-face situa- 
tion. 

A somewhat different approach to 
this problem was reported by O’Neill 
and Stephens (17). This work, based 
upon the M.A. thesis (Ohio State Uni- 
versity, 1956) of Stephens, reports 
somewhat lower correlations, signifi- 
cant at the 5% level, between parts of 
the Utley Test and the Mason Test. 
Correlations between a teacher’s rating 
and the Utley Parts I and III were 
similar to the present results but again 


significant at the 5% level. There are 
several differences between the two 
studies. The 26 subjects used by O’Neill 
and Stephens were all taking formal 
lipreading instruction, whereas the sub- 
jects in the present study had had none. 
The O’Neill-Stephens group may have 
been more homogeneous because of this 
lipreading instruction, but no measures 
are reported that might yield such a 
fact. The rating in the O’Neill and 
Stephens study was made by only one 
teacher whereas the rating scores used 
here were based on the ratings of five 
judges, none of whom was the sub- 
ject’s teacher. Finally, O’Neill and 
Stephens use the low correlations be- 
tween filmed tests and the single- 
teacher rating to question the validity 
of the filmed test and the validity of the 
rating. On the contrary, it is suggested 
here that the criterion of validity is 
defined in terms of a good rating scale, 
and that the validity of a filmed test 
can be ascertained only in terms of its 
correlation with such a criterion. 


Factors Related to Lipreading 


There seems to be informal agree- 
ment among teachers of lipreading that 
their most successful students should 
have a high ability to synthesize mean- 
ingful wholes from fragmentary parts. 
This synthetic ability seems to be the 
only one on which there is relative 
unanimity of opinion. Much less agree- 
ment appears to be found among 
teachers with regard to other abilities. 

Previous Investigations. Many more 
attempts to find factors that seem to be 
highly correlated with lipreading abil- 
ity are to be found in the literature. 
These attempts have been focused on 
the individual’s age, intelligence, edu- 
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cation, language and reading ability, 
synthetic ability, and the perception of 
color and form. 


Age. Relating chronological age to 
lipreading ability, the Heiders (7) re- 
port a correlation coefficient of about 
19 for children 8 to 17 years. Reid 
(20) reports the same number for stu- 
dents 10 to 22 years. These studies are 
supported by Utley’s (22) conclusion 
that age is not an important variable in 
lipreading ability. 

Intelligence. For a sample of chil- 
dren in advanced classes in schools for 
the deaf, Pintner (18) found no sig- 
nificant correlation between lipreading 
and intelligence, as measured by his 
Non-Language Test. Reid (20) did not 
find a correlation with intelligence in a 
sample of children from another school. 
On the other hand, O’Neill (14) found 
a correlation coefficient of .55 between 
lipreading and intelligence as measured 
by the Wechsler-Bellevue Performance 
Scale for a sample of normally-hearing 
young adults. 


Education. Another negative find- 
ing of Reid (20) was the lack of corre- 
lation between lipreading and grade 
level or length of training at schaols for 
the deaf. Utley (22) also found no 
correlation. But Heider (9) reported 
a correlation coefficient of .51 between 
the lipreading ability of school-age deaf 
children, as measured by the Utley 
filmed test, and the number of years in 
school. The Heiders (7) earlier had 
found a correlation coefficient of .54 


between lipreading and educational. 


achievement. 


Language and Reading. Both lan- 
guage ability and ability for abstract 
thinking were claimed to be important 
for lipreading by Goldman (4) and by 


Utley (22), but O'Neill and Davidson 
(16) could find no correlation between 
reading ability and lipreading. 

Synthetic Ability. Not only teach- 
ers’ opinions but also some systematic 
results support the notion that synthetic 
ability is related to lipreading. With a 
tachistoscope, Kitson (10) determined 
a relation between synthetic and analy- 
tic perception and lipreading ability. 
A reading test and a completion test, 
presumed to be indicative of synthetic 
ability, yielded correlation coefficients 
of .67 and .65, respectively, with lip- 
reading ability, in a group of 15 hear- 
ing-handicapped subjects. From the 
results of a similar experiment, Gop- 
fert (5) concluded that synthetic abil- 
ity should be dominant in the success- 
ful lipreader. 

Lipreading Material. It appears that 
the perception of the phoneme plays a 
role in comprehension through lip- 
reading, but investigators are not in 
agreement and some findings are in 
direct conflict. The Heiders (7) found 


‘a higher correlation between the ability 


to perceive vowels and lipreading than 
between the ability to perceive conso- 
nants and lipreading. On the other 
hand, O’Neill (/4) reported that per- 
ception of the phoneme had the great- 
est effect on the identification of con- 
sonants and less on the recognition of 
vowels, words, and phrases. This 
difference might be attributed to the 
difference in population: the Heiders’ 
group was deaf children; O’Neill’s was 
hearing adults. Utley (22) concluded 
that stories served as better indices of 
performance than words or sentences. 
Nitchie (13) had said earlier that one 
of the most effective lipreading tests to 
determine the pupil’s skill was story 
work. For Goldman (4), the knowl- 
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edge of the verb of the sentence seemed 
to be the most important factor. 

Color and Form. Using a sorting 
task, the Heiders (8) found that good 
lipreaders sorted according to color 
while poor lipreaders preferred form. 
Using the Hanfmann-Kasanin Test, 
which requires the sorting of blocks 
according to height and size rather 
than form or color, O’Neill and David- 
son (16) found a significant relation. 
They attributed it to the fact that the 
test measured nonverbal concept for- 
mation. 

Other Factors. O’Neill (14) also 
explored personality adjustment as a 
factor relating to lipreading ability but 
found low and, in some instances, nega- 
tive correlations. He found this to be 
true also when he related ‘speech atti- 
tude’ to lipreading performance. 

Tactile reinforcement, the speech 
read by the eye and felt by the fingers, 
may be significant according to Gault 
(3). The Heiders (7) found perform- 
ance in rhythm in dancing and in gym- 
nastics related to lipreading ability 
(r = .54). Woolley (26) reported im- 
provement in lipreading ability when 
employing the tachistoscope as a train- 
ing device, but O’Neill and Davidson 
(16) reported no statistically signifi- 
cant relation between visual memory 
span and success in lipreading. 

A survey of the literature yields a 
varied list of factors that might be re- 
lated to lipreading but there is lack of 
agreement among investigators as to the 
degree of the relationship. There is also 
a wide variation in the population in- 
vestigated. Some were normal hearing 
subjects, some congenitally deaf chil- 
dren, and others deafened adults. There 
is a lack of consistency in the measures 
of lipreading ability. The latter in- 


cluded speaking with subjects, using 
teacher’s evaluations, and silent movie 
films. The fact that there are several 
such films available adds further to the 
possibility of differences. Synthetic 
ability and chronological age as signifi- 
cant factors relating to lipreading seem 
to be the only ones on which several 
investigators agree although their find- 
ings are inconclusive. Differences in 
correlation were found in intelligence, 
education, language, reading, percep- 
tion of color and form, and visual mem- 
ory. Factors such as personality, tactile 
reinforcement, and rhythm have not 
been investigated by more than one ex- 
perimenter. 

Factors Used in the Present Study. 
In searching for the most appropriate 
factors to be measured in the present 
study, leads from the past literature and 
from clinical experience, and also some 
mere guesses were used. General cate- 
gories investigated, in addition to age 
and characteristics of the hearing, 
were: intelligence, reading, concept 
formation, synthetic ability, and certain 
perceptual abilities in vision and audi- 
tory rhythm. The specific tests used 
for these factors were as follows: 

Intelligence. Form 1 of the Wechs- 
ler-Bellevue Scale (25) was used to 
measure intelligence. The complete bat- 
tery consists of 10 tests, the first five 
of which are described as ‘verbal,’ the 
last five as ‘nonverbal.’ They are: In- 
formation, 25 questions for range of 
information; Comprehension, 12 ques- 
tions on practical information and eval- 
uation of past experience; Similarities, 
12 pairs of nouns having a similar fac- 
tor; Vocabulary, defining of 42 words; 
Picture Arrangement, six pictures for 
sequence story; Picture Completion, 
discovery of a missing part; Digit Sym- 
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bols, timed symbol-to-symbol identifi- 
cation; Block Design, timed colored- 
block assembly for designs. (Arithmetic 
and Object Assembly given to obtain a 
Full Scale Score were not used for 
correlation in the study and are not 
described here.) 

Reading. The Iowa Silent Reading 
Test (6) consists of two somewhat di- 
verse types of prose: Rate of Reading, 
in which subject reads for given time 
and is checked; and Comprehension, 
consisting of three subtests. These sub- 
tests are sentence meaning, 50 ques- 
tions checking comprehension of sen- 
tence as a whole; paragraph compre- 
hension, 12 paragraphs checking ability 
to generalize and identify details; and 
key words, 20 questions requiring gen- 
eralization to a single word. 

Concept Formation. The Hanf- 
mann-Kasanin Test (19) is composed 
of 22 blocks of five different colors and 
four different shapes. These are to be 
sorted into categories. This test samples 
nonverbal concept formation. 

Synthetic Ability. Three measures 
were utilized to test for synthetic abil- 
ity. They are the Gestalt-Completion 
Test, 30 fragmented pictures of com- 
mon objects projected upon a screen; 
Thurstone Mutilated Words, 26 frag- 
mented printed one and two syllable 
words; Fragmentary Sentences, parts 
of ‘100 sentences requiring completion. 

Visual Memory Span. Two tests 
were used to measure memory span. 
They are: Object Span Test, 21 slides 
of two or more common objects, as 
pictured by Dolch, for retention of ob- 
ject names; Digit Span Test, 43 slides 
containing eight 3-digit, nine 4-digit, 
six 6-digit, ten 5-digit, and seven 7- 
digit slides, for retention of digits pre- 
sented visually. 


Rhythm. Seashore Test (21) is a 
test designed to measure the sense of 
rhythm of an individual. The subject 
listens in rapid succession to two re- 
corded rhythmic patterns, then indi- 
cates whether the patterns are the same 
or different. Form A is recommended 
for use with unselected groups, Form 
B for use where finer screening is de- 
sirable. One point is given each cor- 
rect response. 

Procedure. This study was de- 

signed to measure the possible relation 
between a wide variety of individual 
factors and lipreading ability. The sub- 
jects were 24 adults, 12 men and 12 
women, who had been seen in the hear- , 
ing clinics of Central Institute for the 
Deaf and for whom lipreading instruc- 
tion had been recommended. None of 
the subjects had*had any previous for- 
malized lipreading instruction and it 
was assumed that since this instruction 
was needed, there would be a relatively 
high motivation for lipreading. 
- After an initial hearing test, each of 
the 24 subjects was interviewed by the 
group of judges in a manner described 
in the first section of this paper in con- 
nection with the establishing of a cri- 
terion for lipreading. Following the in- 
terview and the judges’ rating of lip- 
reading ability, one of the silent film 
tests was shown to the subjects as a 
group. No lipreading instruction was 
given during the time of the study. 
After all of the other factors had been 
measured, the group was shown the 
other filmed test of lipreading. 

Each subject was seen twice by a 
psychometrist who gave, at the first 
visit, the Wechsler-Bellevue Test and, 
at the second, the Hanfmann-Kasanin 
Test. 
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TaBLE 3. Hearing and age compared with lipreading measures. The last three columns give Pearson rs 
estimating relations between lipreading variables and the hearing and age variables. 











Variable Mean Ss. D. Pearson rs 
Interview Mason Utley 
Age 47.13 9.64 -.19 —.25 —.22 
Duration of Loss 19.12 15.04 .51* .32 .29 
Pure-tone Hearing Loss 33.85 21.10 21 .38 27 
Discrimination Score 58.70 20.15 —.26 -.001 —.36 








* Significant at the 1% level. 


The stimulus items of the Gestalt 
Completion, the Object Span, and the 
Digit Span Tests were flashed upon a 
screen that was viewed by all subjects 
as a group. At first, a nontest picture 
was shown so that the subjects could 
seat themselves for comfortable view- 
ing and they all occupied the same seats 
in subsequent test sessions. A verbal 
signal, ‘Ready, look,’ preceded all test 
slides. The items of the Digit Span and 
the Object Span Tests were projected 
for 0.1 second. The Gestalt Completion 
Items were projected for 5 seconds. 


The Iowa Reading Test, the Frag- 
mentary Sentences, and the Mutilated 
Words were all paper-and-pencil tests 
and were administered to the subjects 
as a group. The standardized procedure 
for the lowa Test was used, a time limit 
of 15 minutes was allowed for the 
Fragmentary Sentences, and five min- 
utes were allowed for the Mutilated 
Words. 

Results. The results will be ordered 
according to the factors studied. 

Hearing and Age. The age and 
hearing characteristics of the sample 


TaBLE 4. Intelligence measures compared with lipreading measures. The last three columns give 
Pearson 1s estimating relations between lipreading variables and intelligence variables. 











Wechsler-Bellevue Mean s. BD. Pearson rs 

Interview Mason Utley 
Information LEST 2.7 -.05 .03 .07 
Comprehension 12.8 2.1 16 .06 24 
Similarities 12.9 2.6 —.35 -.24 -.16 
Vocabulary 12.1 2.6 —.05 -.03 .07 
Verbal Score 61.5 10.3 .005 -.09 -.07 
Picture Arrangement 9.9 3.1 .18 .46* .48* 
Digit Symbol 10.4 3.4 .42* .42* .50T 
Block Design TT 2.7 .44* 23 .43* 
Picture Comprehension 12.4 2.3 -08 —.04 .000 
Verbal IQ 119.4 11.6 -.008 -.11 -.08 
IQ Full Scale 121.7 9.5 25 13 21 








* Significant at the 5% level. 
} Significant at the 1% level. 
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Taste 5. Measures of reading skills compared with lipreading measures. The last three columns 
give Pearson 7s estimating relations between lipreading variables and variables in reading skills. 








Variable 





Mean S. D. Pearson rs 
Interview Mason Utley 
Rate 189.85 29.82 .02 33 .35 
Comprehension 150.67 23.86 .02 .08 .32 
Sentence Meaning 173.63 22.81 .02 33 .44* 
Paragraph Comp. 148.83 24.78 .03 31 38 
Key Words 153.42 16.92 04 .53T .58t 








* Significant at the 5% level. 
{ Significant at the 1% level. 


are shown in the first two columns of 
Table 3. The hearing loss is the average 
hearing loss at 500, 1000, and 2000 cps. 
With a mean hearing loss of only 33 
db and an SD of 21 db, it is clear that 
a few of the subjects had very mild 
hearing losses, particularly for the fre- 
quencies that were averaged. These 
mild losses, however, were usually ac- 
companied by sufficient discrimination 
loss that the entire group can be said 
to have been handicapped. The correla- 
tions between the four factors (age, 
duration of loss, amount of hearing 
loss, and discrimination score) and each 
of the three measures of lipreading abil- 
ity are shown in the last three columns 
of Table 3. None of these factors is 
significantly correlated with the results 
of the filmed tests of lipreading and 
only the duration of loss is significantly 
correlated with the lipreading criterion 
that resulted from the interview rating. 

Intelligence. Table 4 shows the re- 
lation between each of the subtests and 
subtotals of the Wechsler-Bellevue 
Test of intelligence and each of the 
three measures of lipreading ability. 
The bottom row shows the correlations 
between the full-scale IQ and lipread- 
ing ability. The lack of significant cor- 
relation between IQ and any measure 


of lipreading ability agrees with previ- 
ous findings on normally hearing sub- 
jects (16) and on hard-of-hearing sub- 
jects (18, 20). The significant relation 
that was found between the Wechsler- 
Bellevue and the Mason filmed test 
by O’Neili (/5) pertained to subjects 
with normal hearing. 

Among the subtests, the Digit Sym- 
bol is significantly correlated with all 
three measures of lipreading, the Pic- 
ture Arrangement Test with the two 
filmed tests of lipreading, and the Block 
Design Test with the Utley filmed 
test and with the interview criterion. 
Both the Digit Symbol and the Block 
Design Tests involve conceptualization 
of relations among parts without a 
verbal component. To be sure, picture 
comprehension and similarities are sub- 
tests that also require conceptualiza- 
tion, but these may be said to involve 
verbal concepts. 

Reading Skills. The five scores 
available from the Iowa Reading Test 
are shown in Table 5, along with their 
correlations with the lipreading tests. 
The ability to extract sentence mean- 
ing and key words appears to be re- 
lated to the ability that is tested by the 
filmed tests, but none of the factors 
in reading appears to be significantly 
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TaBLe 6. Measures of synthetic ability compared with lipreading measures. The last three columns 
give Pearson rs estimating relationships between lipreading variables and variables in synthetic ability. 











Variable Mean S. D. Pearson rs 
Interview Mason Utley 
Gestalt Picture Completion 15.29 6.93 .38 .25 .29 
Mutilated Word 94.17 4.75 .o2 .28 .24 
Fragmentary Sentence 85.77 11.96 .06 .44* -40* 








* Significant at the 5% level. 


correlated with the lipreading ability 
that is measured by the interview rat- 
ing. This finding is not in basic dis- 
agreement with previous work on read- 
ing. 

Concept Formation. The Hanf- 
mann-Kasanin Test is assumed to meas- 
ure the ability to formulate concepts 
or to abstract. It seemed reasonable to 
suppose that the ability tested would 
be related to lipreading, particularly 
since the relation had already been 
shown to be significant by O'Neill and 
Davidson (16). In the present study the 
correlation between the Hanfmann- 
Kasanin and the Interview (r = —.34), 
Mason film (r = —.36) and Utley 
(r = —.29) all proved to be negative 
and not significant. 

Synthetic Ability. Of the three 
tests that may be presumed to measure 
whatever is meant by the term syn- 


thetic ability (Table 6), the only one 
that appears to be significantly corre- 
lated with lipreading is the Fragmen- 
tary Sentences and here the correlation 
is significant only for the filmed tests. 
Insofar as these three tests may truly 
represent good measures of synthetic 
ability, one cannot conclude that syn- 
thetic ability is related to the general 
ability to lipread, as measured in the 
interview situation. There remains the 
possibility, of course, that synthetic 
ability is really necessary and that a 
good test of it is yet to be found. 
Visual Memory Span. The two 
measures of visual memory span used 
were concerned only with the ability 
to recognize quickly and remember a 
group of objects or a line of digits. 
According to the results shown in 
Table 7, the Object Span Test was 
significantly correlated with all meas- 


TaBLE 7. Measures of visual memory span compared with lipreading measures. The last three columns 
give Pearson rs estimating relationships between lipreading variables with measures of visual memory 











span. 
Variable Mean S. D. Pearson rs 
Interview Mason Uiley 
Object-Picture Span 4.52 .73 .50* -42t .57* 
Digit Span 5.40 94 04 38 








* Significant at the 1% level. 
{ Significant at the 5% level. 
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TaBLE 8. Measures of rhythm compared with lipreading measures. The last three columns give Pearson 
rs estimating relationships between lipreading variables and rhythm-test results. 











Test Mean S. D. Pearson rs 
Interview Mason Utley 
Seashore A 24.79 3.9 .ol ,47* .46* 
Seashore B 22.25 3.7 .03 .30 .29 








* Significant at the 5% level. 


ures of lipreading and at least this as- 
pect of visual perception appears to be 
an important factor. It is not at all clear 
why the visual span is related to lip- 
reading when objects are used, but not 
when digits are used. 

Rhythm. Assuming that the ryth- 
mic perception in the auditory system 
may be related to rhythmic perception 
in vision, it is reasonable to suppose 
that such ability would be important 
for capturing the rhythmic aspects of 
speech. The results in Table 8 show 
that the performance on the easier part 
of the Seashore Rhythm Test (Part A) 
is significantly correlated with the lip- 
reading ability that is measured by the 
filmed tests, but not the lipreading abil- 


ity measured in the interview. Another 
kind of rhythm, demonstrated in gym- 
nastic tasks, yielded a correlation co- 
efficient with lipreading ability of .54 
according to the Heiders (7). 

Conclusions. The correlations be- 
tween the results on the various tests 
which sampled abilities in seven realms 
and lipreading ability are to be found 
in Tables 3 through 8. Table 9 is a 
summary of those test results that are 
significantly correlated with one or 
more of the measures of lipreading 
ability. From the information in this 
summary table the following conclu- 
sions may be drawn: 

(a) A study of factors possibly re- 
lated to lipreading ability must be lim- 


TaBLE 9. Relations between measures of lipreading ability and those factors found to be most highly 











correlated. 
Factor Test Pearson rs 

Interview Mason Utley 
Hearing Duration of Loss Bs bg -48F 
: Sentence Meaning A4t 
Reading Key Words .53* “58* 
Picture Arrangement .46F - .48t 
Intelligence Digit Symbol A2t 427 .50* 
Block Design 447 A8T 
Synthetic Ability Fragmentary Sentences 44F 40T 
Visual Memory Object-Picture Span .50* .42t ocr 
Rhythm Seashore A ATT -46t 








* Significant at the 1% level. 
{ Significant at the 5% level. 
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ited to the kind of lipreading ability 
measured. In the present study, only 
two test results are significantly related 
to both kinds of lipreading measures 
(filmed test and interview): the Digit 
Symbol subtest of the Wechsler-Belle- 
vue Intelligence Scale, and the Object 
Picture Span, presumably one aspect of 
visual perception. These two tests ap- 
parently pertain to the ability to per- 
ceive rapidly and remember a number 
of items and the ability to associate 
symbols with symbols. 

_(b) Four factors appear to be re- 
lated to the kind of lipreading ability 
that emerges from an interview rating. 
Two of them have been mentioned in 
(a) above. The other two are the dura- 
tion of the hearing loss and the com- 
prehension of relations as measured by 
the Block Design subtest of the Wechs- 
ler-Bellevue Intelligence Scale. 

(c) Many of the factors appear to be 
related to the kind of Kpreading ability 
that is measured by the filmed tests. 
Those factors that are significantly cor- 
related with results of the filmed test 
appear to involve the ability to abstract 
meaning from printed material, the 
ability to perceive relations in a com- 
plex visual pattern, and the ability to 
reconstruct meaningful wholes from 
part stimuli that are impoverished in 
terms of spatial parts missing or in- 
sufficient time of presentation. 
Conclusive interpretation of these re- 
sults is difficult to make. It is reason- 
able to suppose that the longer a per- 
son has a hearing loss, the more he has 
come to depend upon information 
through lipreading, even though he has 
not had formal lipreading instruction. 
It is not certain whether other factors 
that are significantly related, such as 
the perception of spatial relations and 


the ability to perceive in spite of miss- 
ing parts or quick presentation, con- 
stitute general abilities. If they do, the 
possibility remains that these are not 
genetically-determined factors but 
rather the kind which can be improved 
with certain kinds of training. If one 
could, for example, improve a subject’s 
object-picture span, then he might be 
a better candidate for lipreading in- 
struction. On the other hand, a result 
from the Digit Symbol Test, presum- 
ably an aspect of biologically-deter- 
mined intelligence, cannot easily be 
thought to be improvable. 

Summary 

An analysis was made correlating 
data on various tests with data on lip- 
reading ability of 24 subjects with im- 
paired hearing. The first purpose of 
this study was to develop a valid meas- 
ure of lipreading ability through the 
use of a rating scale applied to the re- 
sults of an interview, to represent an 
real-life lipreading situation. The rat- 
ings of the five judges were in very | 
close agreement (r = .92) and they 
were significantly correlated with the 
scores obtained on the same subjects 
with two filmed tests of lipreading 
(Mason and Utley), but the correla- 
tions were not high. 

Psychological measures of concept 
formation, intelligence, visual percep- 
tion, reading, rhythmic perception, and 
synthetic ability, in addition to age, 
amount of hearing, and duration of 
hearing loss, were compared with meas- 
ures of lipreading ability. Several abil- 
ities showed significant correlation 
with lipreading as measured with a 
filmed test, but only four specific test 
results were significantly correlated 
with lipreading as measured by rating 
from an interview. 
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Intelligibility and Associated Physiological Factors 


of Cleft Palate Speakers 


JOANNE D. SUBTELNY 


J. DANIEL SUBTELNY 


Anatomic and physiologic studies of 
cleft palate subjects have yielded con- 
sistent evidence that the pathology in 
cleft palate speech results primarily 
from velopharyngeal inadequacy or 
dysfunction (J, 3, 8). The articulatory 
and quality characteristics of cleft 
palate speech have been discussed ex- 
tensively in speech literature (4, 5, 6, 
9). Despite these facts, there is a defi- 


nite need to specify the physiologic 
parameters of cleft palate speech de-? 


fects. This specification is important to 
an understanding of the influence of 
structural and functional deviations 
upon the intelligibility of speech 
sounds. Such an understanding would 
make it easier to determine which 
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sounds can or cannot be achieved or 
improved. These considerations should 
determine to a large extent the ultimate 
objectives of treatment, the therapeutic 
emphasis, and the techniques to be em- 
ployed. 

The basic purpose of this study was 
to examine specific articulatory errors 
in cleft palate speech in association with 
measurements of the velopharyngeal 


and dental relationships. The specific 


aims of this study are as follows: 

(a) To define, by intelligibility test- 
ing, articulatory errors in cleft 
palate speech. 

(b) To determine statistically 
whether articulatory errors in 
cleft palate speech can be cate- 
gorized phonetically. 

(c) To study relationships between 
specific types of articulatory 
errors and velopharyngeal meas- 
urements. 

(d) To study relationships between 
specific types of articulatory er- 
rors and measurements of dental 
relationships. 


Procedure 


The study of possible relationships 
between articulatory errors in cleft 
palate speech and physiologic factors 
made necessary the attainment of high- 
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fidelity tape recordings of an articula- 
tion test, x-ray registrations, and pho- 
netic evaluations of the tape recordings. 

Subjects, The subjects for this study 
were 27 cleft palate individuals ranging 
from 16 to 36 years of age. All subjects 
had surgically repaired clefts of the 
velum and hard palate. Nineteen sub- 
jects had complete unilateral clefts of 
the lip and palate, five had complete bi- 
lateral clefts of the lip and palate, and 
three had clefts of the hard and soft 
palates with no involvement of the 
alveolar process or lip. 

Radiographic records and high-fidel- 
ity tape recordings of an articulation 
test were accumulated for these sub- 
jects. 

Articulation Tests. The purpose of 
the articulation test was to determine 
whether or not subjects could produce 
identifiable consonants appearing in 
simplified and controlled phonetic en- 
vironments. For this reason nonsense 
syllables, which do not provide con- 
textual clues to the identity of the con- 
sonants, were used to test articulation. 

The test material consisted of 48 non- 
sense syllables which included an equal 
number of consonant-vowel and vowel- 
consonant combinations. Consonants 
were selected to represent the three 
major classifications - of consonants. 
Plosives [p] and [b], [t] and [d], [k] 
and [g]; fricatives [f] and [v], [s] and 
[z]; and glides [1] and [r] were com- 
bined with vowels [i], [u], [a], and 
[a]. Consonants were combined with 
each of the four vowels so that each 
consonant appeared twice in the initial 
and twice in the final position. For ex- 
ample, the consonant [p] appeared in 
syllables [pi], [pa], [ep], [up]. 

Each subject read the 48-syllable 
articulation test with five-second inter- 


vals between syllables. Tests were re- 
corded by use of the Magnecorder, 
Model PT-6, and the Electro-Voice, 
Model 650 microphone. To minimize 
practice effect in listening, the order 
in which syllables were recorded was 
randomized for each subject. 

The tapes were played back in a 
sound-treated room for seven speech 
correctionists who made phonetic tran- 
scriptions. These transcriptions were 
analyzed and the results averaged ac- 
cording to incidence of error and type 
of error. 


Physiological Factors. To determine 
the physiological factors associated 
with articulatory errors, measurements 
were made from tracings of midsagittal 
cephalometric laminagraphs.’ Films 
were taken under two conditions: with 
the teeth in occlusion and during pho- 
nation of [u]. Head stabilization was 
insured by a special head-holding de- 
vice. Tape recordings of the vowel 
phonation, obtained simultaneously 
with the radiographic exposure, were 
later played back for phonetic identifi- 
cation of the vowel produced. 

Two measurements were made from 
the films. The first, to quantify the de- 
gree of velopharyngeal constriction 
during sustained phonation of [u], was 
made at the point of closest approxi- 
mation between the posterior aspect of 
the velum and the anterior margin of 
the posterior pharyngeal wall. The 
second was obtained from films taken 
with the teeth in occlusion and was 
used to define the anteroposterior rela- 





*Laminagraphs are x rays obtained through 
the utilization of a body-sectioning radio- 
graphic technique. A specific lamina or layer 
of body structure can be visualized radio- 
graphically without the superimpositioning of 
structures on either side of the desired lamina. 
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tionships between the upper and lower 
incisors. 

In normal dental relationships, the 
lingual surfaces of the upper incisors 
overlap or contact the labial surfaces of 
the lower incisors. A reading of 0 mm 
indicated that a normal anteroposterior 
relationship of the incisors existed. 
Positive readings indicated a degree of 
overjet or a protruded position of the 
upper (maxillary) incisors in relation 
to the lower (mandibular) incisors. 
Similarly, minus readings indicated a 
retruded position of the upper incisors 
in relation to the lower incisors. This 
method was utilized as a descriptive 
index of anterior dental relationships 


and does not offer reliable information 


pertaining to possible lateral constric- 
tion in the maxilla. A buccolingual con- 
striction of the maxillary dental arch 
cannot be visualized in a midsagittal 
laminagraph. Despite this limitation, the 
evaluation of the relationship of the 
anterior teeth seems indicated. It has 
been found that in hypernasal cleft 
palate subjects, significant deviations in 
tongue posture were associated with 
severe retrusions of the maxilla (8). 
Maxillary retrusions would be mani- 
fested by a lingual positioning of the 
maxillary incisors relative to the man- 
dibular incisors. For this reason, the 


relationship of the anterior teeth was 
measured. 
Results 

Articulatory Errors. The average 
number of articulatory errors on the 
48-syllable test was found to be 11.03. 
For a better definition of the consonant 
types most commonly misarticulated, 
the average number of errors for each 
consonant pair was determined. These 
were as follows: [s] and [z], 2.79; [p] 
and [b], 2.17; [f] and [v], 1.80; [k] 
and [g], 1.72; [t] and [d], 1.42; and 
[1] and [r], 1.15. A decrease in the. 
average number of misarticulations is 
indicated by the order reported, but 
only the mean for the pair [s] and [z] 
was significantly different from the 
means of the other pairs. When con- 
sonant pairs were grouped according to 
phonetic classifications, an obtained t 
of 2.32 revealed that the incidence of 
fricative errors was significantly greater 
than the incidence of plosive errors. 

Correlation coefficients (Table 1) 
were obtained to determine whether or 
not misarticulations on one pair of con- 
sonants were related significantly to 
misarticulations on other consonant 
pairs, and to determine whether or not 
a pattern of articulatory errors could 
be established. Such a pattern might be 
studied for some common physiologic 


TaBLE 1. Correlation coefficients estimating relationships between different types of articulatory 


errors of adult cleft palate speakers. 











{f], [v] {t], [d] [s], [z} fl], fr] [k], [g] 
{p], [b] 0.20 0.32 0.18 0.17 0.62* 
lf], [v] .A9t 522 0.42t 0.44 
[t], [d] .50* 0.36 0.50* 
[s], [2] 0.15 0.17 
{], tr] , 0.20 








* Significant beyond 1% level. 
{ Significant beyond 5% level. 
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factor associated with the specific ar- 
ticulatory defects in cleft palate speech. 
Consonant pairs were not, in general, 
found to be related to other consonant 
pairs on frequencies of errors. Excep- 
tions to this were the significant corre- 
lations between plosives. As shown in 
Table 1, correlations significant be- 
yond the 1% level were found between 
labial plosives and lingua-velar plosives; 
between lingua-alveolar plosives and 
lingua-velar plosives; between lingua- 
alveolar plosives and _lingua-alveolar 
frictatives. Since these three groups of 
plosive consonants have differing posi- 
tions of articulations, the suggestion 
that articulatory errors show similarity 
in position of articulation is not sup- 
ported. However, the significant inter- 
relationships indicate that there was 
some consistency of frequency of ar- 
ticulatory error during the production 
of sounds with abrupt onset character- 
istics. 

No significant relationship was found 
between errors on [s] and [z] and 
errors on [f] and [v] even though 
these two fricative pairs are similar in 
manner of production, both being sus- 
tained sounds produced by forcing the 
breath stream through constricted areas 
within the vocal tract. It was also not 
found that glide errors were related to 
errors on any of the other consonant 
pairs, if the 1% level of confidence is 
used as a criterion. 

Physiologic Factors. Dental and velo- 
pharyngeal measurements revealed that 
marked individual variation existed in 
both areas. Anterior dental relationships 
were found to vary from a slight over- 
jet of +3.00 mm to severe maxillary 
retrusions of —11.00 mm. If significance 
can be attached to a mean of —2.26 
mm, it can be stated that this sample of 


adult cleft palate subjects did not show 
severely disturbed anteroposterior rela- 
tionships of the incisors. 

Velopharyngeal measurements ranged 
from complete closure to a marked 
openness of 16 mm. The mean for velo- 
pharyngeal opening during phonation 
was 5.41 mm. Statistical correlation of 
velopharyngeal and dental measure- 
ments showed these two measurements 
were not related significantly. 

Total Articulatory Errors, Velo- 
pharyngeal Measurements. On the basis | 
of total number of articulatory errors, | 
subjects with the highest degree of , 
articulatory proficiency were those; 
whose associated velopharyngeal rela-\ 
tionships closely approximated the nor- \ 
mal. Ten subjects were found to have~ 
relatively good articulation. In eight of 
these 10 subjects velopharyngeal open- 
ing did not exceed 1.5 mm. This finding 
tends to support the premise that articu- 
latory proficiency is associated with 
near normal velopharyngeal function. 

While a positive relationship be- 


“tween good velopharyngeal function 


and proficiency in total articulation was 
indicated, the results in regard to total 
articulation scores for the entire sample 
do not permit the conclusion that there 
is a well defined relationship between 
total articulatory proficiency and velo- 
pharyngeal function. The number of 
total errors, as determined by the non- 
sense-syliable test utilized, did not in- 
crease linearly as the degree of velo- 
pharyngeal opening increased. Some 
subjects were able to achieve far better 
total articulatory scores than others 
with essentially the same degree of 
velopharyngeal opening. In short, 
marked variation in total intelligibility 
scores for specified degrees of velo- 
pharyngeal opening was revealed. 








Cleft Palates, 


TaBLe 2. Correlation coefficients estimating re- 
lationships between specific articulatory errors 
and measurements of dental relationships, and 
between specific articulatory errors and measure- 
ments of velopharyngeal openings in adult cleft 
palate speakers. 











Consonants Dental Velopharyngeal 
Tested Measurements Measurements 
r r 
[p], [b] -0.14 0.56* 
[f], [v] 0.03 0.36 
[t], [d] -0.27 0.36 
[s], [z] -0.06 -0.03 
(, [r] 0.10 0.03 
[k], [g] -0.31 0.69* 
Dental -0.33 
Measurements 








* Significant beyond 1% level. 


Specific Articulatory Errors, Velo- 
pharyngeal Measurements. Since glides, 
plosives, and fricatives have essentially 
different methods of production, there 
seems a possibility that there may be 
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also a difference in their intelligibility 
when produced by subjects with cleft 
palate aberrations. Considering this 
possibility and the clinical importance 
of determining which consonants are 
most consistently defective with associ- 
ated velopharyngeal inadequacy, study 
of the correlations between specific ..’ 
articulatory errors and velopharyngeal 
measurements was indicated. These 
correlations are presented in Table 2. 


As a group, plosive errors were 
found to be consistently related to | 
velopharyngeal inadequacy. Correla- 
tions, significant beyond the 1% level, 
in spite of the restricted range for ar- 
ticulation scores, were found between 
errors on labial plosives and velo- 
pharyngeal openings (.56); and _ be- 
tween errors on lingua-velar plosives 
and velopharyngeal openings (.69). 


As indicated in Figure 1, both labial 
and velar errors increased appreciably 
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Ficure 1, Incidence of lingua-velar and labial 
measurements for each of 27 adult cleft palate 


plosive errors with associated velopharyngeal 
speakers. 
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when the velopharyngeal opening 
equalled or exceeded 5 to 7 mm. Rela- 
tively few velar and labial errors were 
found when closure was not complete 
Aout within 2 mm of being complete. On 
the basis of these data, it would seem 
that complete velopharyngeal closure 
is not essential for production of 
plosives but that intelligible plosives 
frequently are not produced when 
velopharyngeal openings exceed 5 to 
: 7 mm. 

No significant relationships were 

found between fricative errors and de- 

reree of velopharyngeal openings. Frica- 
tive errors, abundant regardless of 
differences in velopharyngeal openings, 
were found to be almost twice as com- 
mon as plosive errors in speech pro- 

duced by subjects with small velo- 

“pharyngeal openings (within 2 mm of 
complete closure). Cleft palate sub- 
jects, with essentially the same small 
degree of velopharyngeal opening, 
were better able to produce identifiable 
plosives than fricatives. 

As shown in Table 2, no significant 
relationship was found between glide 
errors and velopharyngeal openings. 

Articulatory Errors and Dental 
Measurements. The total number of ar- 
ticulatory errors was not found to be 
consistently or significantly related to 
measurements made between upper and 


lower incisors, To illustrate this, the . 


five subjects with the highest number 
of articulatory errors were found to 
have dental relationships ranging from 
an overjet of +3.00 mm to a maxillary 
retrusion of —5.0 mm, and the five sub- 
jects with the fewest articulatory errors 
had dental relationships ranging from 
an overjet of +2.5 mm to a retrusion of 
—5.00 mm. No significant correlations 
were found between defectiveness in 


any of the consonant pairs studied and 
the degree of anterior malocclusion 
(Table 2). 


Discussion 


Recently research attention has been 
focused upon organized and detailed 
investigations of the articulatory de- 
ficits of cleft palate children (2, 5, 7) 
and adults (3, 4, 9). The incidence of 
articulatory errors reported in the pres- 
ent study is essentially in agreement 
with the results of other investigators 
who have studied a group of cleft 
palate speakers of comparable age (4, 
9). Specifically, this applies to the high 
incidence of sibilant defects and the 
comparatively low incidence of glide 
defects. 


The present study of articulatory de- 
fects with accompanying physiologic 
data has yielded results which suggest 


that the.types or the phonetic charac- 





teristics of articulatory errors in cleft 

eech are influenced by the e de- 
gree_of velopharyngeal opening. For 
example, the incidence of fricative 
errors was considerably higher than the 
incidence of plosive or glide errors in 








| the speech of cleft palate subjects with 


small velopharyngeal openings (2 mm 
or less), Subjects with large velo- 


pharyngeal openings also had a high 


incidence of fricative errors but these 
subjects tended to have also a com- 
parably high incidence of plosive 
errors. 


Stated in other terms, present results ) 
suggest that a specific degree of velo- 
pharyngeal opening has an influence 
upon consonant intelligibility which 
may vary as a function of phonetic 
classification. At the present time, there 
seems little research evidence to sup- 
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port this suggestion. There is evidence, 
however, that velopharyngeal opening 
in cleft palate speakers has an influence 
upon nasality, or the degree of distor- 
tion, which varies as a function of the 
specific vowel produced (6, 8, 9). Be- 
cause nasality has been found to vary 
as a function of the vowel, it has been 
stated that certain vowels have greater 
predictive value than others in estimat- 
ing total nasality in connected speech 
(8, 9). 


Present results, obtained from a™~ 


sample limited to 27 cleft palate sub- 
jects, failed to reveal a direct relation- 
ship between velopharyngeal opening 
and total speech intelligibility. The 
marked variation in total intelligibility 
scores for specified degrees of velo- 
pharyngeal opening does not make it 
possible to predict total speech intelli- 
gibility from an associated measure of 
the degree of velopharyngeal opening. 
However, the significant correlations 
between plosive intelligibility and velo- 
pharyngeal opening suggest that the 
specific evaluation of plosive intelligi- 
bility has value in predicting velo- 
pharyngeal adequacy from the results 
of speech analyses. 


Compositely, present results strongly 
indicate the need for continued study 
of correlation between speech per- 
formance and associated physiological 
factors of cleft palate speakers. Clinical 
implications derived from these data 
should be considered tentative until 
the results of a comprehensive study of 
a larger cleft palate sample are pro- 


vided. 
Summary 


Cephalometric laminagraphs and tape 
recordings of a nonsense-syllable artic- 
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ulation test were obtained from 27 
adult cleft palate speakers. Lamina- 
graphs were traced and analyzed to 
quantitate velopharyngeal and dental 
relationships. Tape recordings were 
played back to a panel of listeners for 
phonetic transcription. The results 
were analyzed and then correlated with 
the physiologic measurements. 


Statistical analysis of consonant er- 
rors revealed the incidence of fricative 
errors was significantly higher than the 
‘incidence of plosive or glide errors. 
Intercorrelations of errors on plosive 
consonants were found to be signifi- 
cant. Correlations between measure- 


Po ? 
ments of velopharyngeal opening and 


plosive intelligibility also were found to 
be significant. On the basis of these find- 
ings, the specific evaluation of plosive 
intelligibility is suggested as useful in 
evaluating adequacy of velopharyngeal 
function. Since plosive errors were 
interrelated, irrespective of differing 
positions of articulation, the plosive 
manner of production is interpreted as 
a consistent attribute of defectiveness 
in cleft palate speech when an appreci- 
lable degree of velopharyngeal opening 
lexists. 
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Effect of Loudness Recruitment 
on Delayed Speech Feedback 


EARL R. HARFORD 


JAMES F. JERGER 


There is presently little experimental 
evidence to show that hearing-impaired 
subjects behave like normal subjects at 
identical sensation levels of delayed 
speech feedback. The results of two 
investigations (1, 2) suggest that hear- 
ing-impaired individuals with recruit- 
ment will show greater effects on de- 
layed speech feedback than normal 
listeners, possibly because of the pres- 
ence of recruitment. Indeed, theoretical 
considerations concerning the charac- 
teristics of loudness recruitment in 
pathological ears suggest that individ- 
uals manifesting this type of hearing 
impairment may show reactions to de- 
layed speech feedback at lower sensa- 
tion levels than either normals or hear- 
ing-impaired persons without recruit- 
ment. Thus, it may be fallacious to 
expect that all individuals with hearing 
impairment will behave like persons 
with normal hearing under exposure to 
amplified speech feedback. 
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Procedure 


Experimental Design. In view of the 
above considerations an experiment was 
designed to evaluate the possible con- 
tribution of loudness recruitment to the 
delayed speech feedback effect. 

The basic design involved the com- 
parison of delayed speech feedback be- 
havior in three groups: (a) 10 hearing- 
impaired subjects with recruitment, 
(b) 10 hearing-impaired subjects with- 
out recruitment, and (c) 10 normal 
listeners. 

The selection of subjects for the first 
group, hearing-impaired with recruit- 
ment, presented a problem. In order to 
afford a fair evaluation of delayed 
speech feedback, it was deemed neces- 
sary to limit all audiometric contours 
to those that conformed to a pattern of 
relatively flat and bilaterally symmetri- 
cal loss. This selection criterion pre- 
cluded the use of any direct test of 
loudness recruitment as a basis for as- 
signing subjects to the first group. It 
was decided, therefore, to include in 
this group only subjects whose hearing 
loss had been medically diagnosed as 
due to labyrinthine hydrops. Since the 
literature overwhelmingly indicates that 
loudness recruitment is a concomitant 
of this etiology, it was simply assumed 
that all subjects in this group had loud- 
ness recruitment, by virtue of the medi- 
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cal diagnosis. Selection of subjects for 
the second group, hearing-impaired 
without recruitment, seemed to be a 
less serious problem. This group was 
composed entirely of subjects with 
bilateral, medically-diagnosed clinical 
otosclerosis. The literature clearly indi- 
cates that conductive loss of this type 
is not accompanied by loudness recruit- 
ment, For the third group, normal lis- 
teners, young adult college students 
were employed. 

Ideally, a comparison of delayed 
speech feedback behavior in these three 
groups would be sufficient. In reality, 
however, the problem is complicated 
by the fact that hearing-impaired sub- 
jects with recruitment, especially those 
with labyrinthine hydrops, tend to 
differ from hearing-impaired subjects 
without recruitment with respect to 
their ability to understand speech, a 
variable which could, conceivably, af- 
fect performance under delayed speech 
feedback independent of the presence 
or absence of recruitment. 

In order to assess the effects of these 
two variables, recruitment and speech 
discrimination loss, separately, the pres- 
ent design included a fourth group, 10 
hearing-impaired subjects with recruit- 
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ment but without speech discrimination 
loss. This condition was achieved by 
creating an artificial hearing loss in 
normal listeners through the use of 
binaurally presented broad-band ther- 
mal masking noise. Again, the literature 
clearly indicates the presence of be- 
havior like recruitment accompanying 
the hearing loss due to a masking noise. 

Finally, during the progress of the 
experiment, it became necessary to add 
a fifth group, 10 subjects with hearing 
impairment created by artificially ob- 
structing the external canals. The need 
for this fifth group became apparent 
as results obtained on the otosclerotic 
subjects were analyzed. It was antici- 
pated that, by virtue of the nature of 
the hearing loss, otosclerotics would 
behave like normal listeners at com- 
parable feedback sensation levels. It was 
observed, with some _ consternation, 
however, that the otosclerotics showed 
more marked reaction to feedback than 
either the labyrinthine hydrops or 
masked normal groups. Reflection on 
this unexpected difference between 
otosclerotics and normals brought two 
possible explanations. First, the effect 
might simply be due to the age differ- 
ence between the two groups. The 


TaBLe 1. Summary of sex distribution, mean age in years, mean air conduction (AC) and bone con- 
duction (BC) loss in db (500-2000 cps average), mean binaural spondee threshold loss in db, and mean 
binaural speech discrimination (PB) score, for each of five experimental groups. 











Group N Age Loss PB Score 
Male Female AC BC Spondee 
Normal 1 9 19.7 —* —* -1.6 96.6% 
Hydrops 6 4 45.9 40.4 37.8 44.0 78.8% 
Masked Normal 2 8 20.5 —* —* 42.2 95.2% 
Otosclerotic 4 6 45.4 38.4 9.1 41.4 96.8% 
Plugged Normal 3 7 42.9 —* —* 27.4 96.8% 








* No pure-tone threshold tests administered. Screened at +10 db. 


+ Tested for speech discrimination with ear canals unplugged. 
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mean age for normals was 19.7 years, 
and the mean age for otosclerotics was 
45.4 years. Perhaps, it was thought, 
susceptibility to the effects of delayed 
sidetone is much less in young than in 
middle-aged adults, irrespective of the 
presence or absence of a conductive 
hearing loss. The second possibility was 
that the discrepancy between normals 
and otosclerotics might simply be due 
to the presence of a hearing loss, per 
se, with or without recruitment. It 
seemed possible that any kind of hear- 
ing loss at all, conductive or sensori- 
neural, has the effect of increasing the 
delayed speech feedback effect. A fifth 
group of 10 subjects was added to rule 
out both of these possibilities simul- 
taneously. To this end, middle-aged 
normal-hearing adults, purposely 
chosen so that their mean age would 
roughly approximate that of the oto- 
sclerotic group, were tested both with 
and without wax-impregnated cotton 
plugs in the external ear canals. Both 
external canals were plugged in order 
to produce an artificial hearing loss not 
accompanied by recruitment. This 
group was also tested without the ex- 
ternal canals plugged in order to isolate 
the possible age effect from the possible 
hearing loss effect. In other words, if 
the behavior observed in the oto- 
sclerotics were due simply to their age, 
the effect would be expected in this 
fifth group of middle-aged normals 
whether their external canals were 
plugged or not. If, on the other hand, 
the effect observed in the otosclerotics 
were due simply to the effect of a hear- 
ing loss, per se, it would be expected to 
appear in the fifth group only under 
the plugged condition. 

For each subject in each group, the 
delayed speech feedback effect was 


measured at feedback sensation levels 
of 0, 10, 20, 30, 40, and 50 db. The 
specific method used to quantify the 
feedback effect was the flashing-num- 
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Ficure 1. Mean pure-tone and as losses 
and mean discrimination score for hydrops 


group. 
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Ficure 2. Mean pure-tone and speech losses 
and mean discrimination score for otoscler- 
osis group. 








364 Journal of Speech and Hearing Research 


bers technique first described by Butler 
and Galloway.’ 


Subjects. Each experimental group 
contained 10 subjects, carefully selected 
on the basis of medical diagnosis? or 
audiological findings or both. For each 
of the five groups, the sex distribution, 
mean age, mean binaural spondee thresh- 
old losses, and mean binaural discrimi- 
nation scores were determined, and for 
the hydrops and otosclerotic groups, 
the mean best binaural pure-tone aver- 
age hearing losses for the three speech 
frequencies (500, 1000, and 2000 cps) 
by both air and bone conduction (see 
Table 1). The mean pure-tone thresh- 
old hearing levels for the labyrinthine- 
hydrops and otosclerotic groups are 
presented in Figures 1 and 2, respec- 
tively. 

Apparatus. A standard pure-tone 
audiometer was used to obtain the hear- 
ing level for each hearing-impaired sub- 
ject and to screen each subject with 
normal hearing. A commercial speech 
audiometer (Grason-Stadler, Model 
162) was used for all speech testing. 
The spondee threshold was established 
with CID revised recorded spondaic 
word lists (W-1). Speech discrimina- 
tion was tested using CID revised re- 
corded phonetically-balanced word lists 
(W-22). Periodic sound pressure meas- 
urements were made over the testing 


*R. A. Butler and T. F. Galloway in a pa- 
per read at the American Speech and Hear- 
ing Association convention, Cincinnati, 1957, 
entitled ‘A comparison of the performance 
of normal hearing and hard of hearing per- 
sons on the delayed speech feedback task.’ 


*All subjects in the hydrops and otosclerosis 
groups were selected from the files of the De- 
partment of Otolaryngology, Northwestern 
University, and from the files of three oto- 
logists in private practice in Chicago. 
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Ficure 3. Simplified block diagram of the ex- 
perimental apparatus. 


period to ensure the stability of the 
signal delivered by the earphones. 
Figure 3 shows a simplified block 
diagram of the apparatus used for the 
delayed speech feedback testing. The 
path of the circuit originated from a 
message spoken into a microphone 
(American, type B-22) fed into a mag- 
netic tape recorder (Magnecord, Model 
PT6-AH) with fixed record and play- 
back heads. The delay period between 
the recorded message and the playback 
message was .167 sec. The output from 


‘the tape recorder was fed into the 


speech audiometer, then through a man- 
ual switch, finally to a set of earphones 
(Telephonic, type TDH-39-10 Z) 
mounted in MX41/AR cushions. A 
broad-band thermal noise, incorporated 
in the speech audiometer, was used for 
the masked-normal group. The acoustic 
spectrum of the noise signal was essen- 
tially determined by the frequency re- 
sponse characteristic of the earphone. 
Both speech and noise signals were al- 
ways delivered binaurally-in-phase. The 
speech audiometer was used to control 
the intensity of the delayed speech 
feedback and, when necessary, to de- 
liver thermal noise to the earphones. 
Isolated from the delayed speech 
feedback apparatus was the flashing- 
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numbers equipment. A multi-vibrator 
triggered a telephone-type stepping 
relay at a rate of two steps per second. 
The relay, in turn, completed a circuit 
at each of 80 steps, briefly illuminating 
one of five two-digit numbers on a 
panel located 4.5 feet in front of the 
subject’s chair and in full view of the 
subject. These numbers, from the sub- 
ject’s left to right, were 24, 31, 58, 63, 
and 82, each number being three inches 
high and centered directly above a six- 
volt amber light bulb. When a light 
flashed, the speaker was to say the 
number above it. The randomization of 
the stimulus numbers, therefore, was 
achieved by randomization of the order 
in which the lights were flashed. On a 
monitoring panel in full view of the 
tester, but not seen by the subject, 
identical numbers were illuminated 
simultaneously. 

Except for the earphones, micro- 
phone, and subject’s stimulus panel, all 
testing equipment was located in the 
control room, isolated from the test 
chamber. 

Testing. The over-all testing se- 
quence for each of the 50 subjects in 
the present investigation consisted of 
three main procedures: (a) a brief in- 
terview and pure-tone testing; (b) 
speech audiometric testing; and, (c) 
delayed speech feedback testing. 

Each subject in the normal group, 
masked-normal group, and the plugged- 
normal group was screened at a hearing 
level of 10 db, across the frequency 
range from 125 through 8000 cps, and 
including 3000 cps, on both ears to en- 
sure normal hearing. The subjects with 
medically-diagnosed labyrinthine hy- 
drops and otosclerosis were given con- 
ventional pure-tone audiometric thresh- 
old tests, using an ascending technique. 


Following the pure-tone testing, the 
spondee threshold was established using 
a similar ascending method. Immedi- 
ately after the determination of the 
spondee threshold, phonetically bal- 
anced words (W-22 recording series) 
were presented at a sensation “ne of 
30 db re spondee threshold. Discrimina- 
tion scores were obtained for subjects 
in the masked-normal group both in 
quiet and during exposure to broad- 
ban¢ thermal noise. 

The third and final test procedure 
consisted of the flashing-numbers de- 
layed-speech-feedback testing tech- 
nique at levels of 0, 10, 20, 30, 40, and 
50 db above the individual subject’s 
spondee threshold. For each subject in 
the masked-normal and plugged-normal 
groups, the spondee threshold obtained 
during the artificially-produced-hear- 
ing-loss condition was used as the base 
of reference from which sensation level 
was computed. 

For the actual test procedure for 
each subject, a microphone, mounted 
on a ‘third hand,’ was placed at a dis- 
tance approximately six inches from 
the speaker’s mouth. The manual 
switch controlling the signal to the ear- 
phones was placed in the off position 
and the subject was presented with a 
practice series of 40 flashes of lights on 
his panel to which he responded by 
saying the numbers as the lights flashed. 
Following the practice run, the actual 
delayed speech feedback test began. 

During each experimental series, the 
tester continuously adjusted the record | 
gain on the tape recorder to keep the 
VU meter needle on the speech audi- 
ometer at a predetermined level. This 
maintained relatively constant input to 
the subject’s earphones in spite of 
changes in vocal intensity resulting 
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Taste 2. Summary of median error scores and ranges of individual error scores for each of five groups 


at six sensation levels of delayed speech feedback. 














Group Sensation Level in db 

0 10 20 30 40 50 
ine Median 4.5 5.0 6.5 9.0 10.5 17.0 

— Range 16 19 21 26 23 23 
mae Median 12.0 14.5 17.0 22.0 28.0 24.0 

ee Range 26 28 32 35 34 24 
Median 5.0 9.0 14.0 17.0 21.0 25.0 

ee ee es 15 33 36 32 28 31 
em Median 9.0 20.0 20.5 26.0 26.5 33.0 

a Range 25 32 42 34 43 28 
Median 9.0 9.5 9.0 10.0 10.5 13.5 

: 
ee Nem 13 14 14 16 18 20 








from the Lombard effect and allied 
phenomena. 

At the conclusion of 40 flashes of 
lights, the relay automatically ceased. 
The tester stopped the tape recorder 
and turned off the earphone switch. A 
switch on the relay bank was thrown 
to a second random-order series for the 
light flashes, and at the end of 15 
seconds the same procedure was re- 
peated at the identical feedback sensa- 
tion level. This provided the subject 
with a brief rest period between the 
40th and 41st number presentations at 
each sensation level. The order in 
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Figure 4. Median error-score functions for 
the five groups. 


which the six sensation levels were pre- 
sented was separately randomized for 
each subject. 

All subjects in the plugged-normal 
group were tested twice at each sensa- 
tion level of delayed speech feedback, 
once with external ear canals plugged 
and once with ears unplugged. The 
order of these test conditions was 


.counter-balanced over the total group. 


The total of the numbers repeated 
correctly by each subject at each sensa- 
tion level was subtracted from 80, the 
best possible score. This difference rep- 
resented the subject’s error score at that 
sensation level. An electric hand coun- 
ter was used to tabulate correct re- 
sponses. 


Results 


Median error scores (Table 2) for 
the five groups at each sensation level 
of delayed speech feedback were deter- 
mined and the results plotted graphi- 
cally (Figure 4+). The curves for the 
hydrops, otosclerotic, and masked- 
normal groups are separated from the 
curves for the normal and plugged- 
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normal groups by a rather wide gap 
starting at the 20-db sensation level. 
The high median error scores obtained 
for the otosclerotic group were totally 
unexpected. 


Discussion 


Viewing these results within the 
framework of the experimental design 
suggests that recruitment does indeed 
accelerate the function relating feed- 
back effect to sensation level. This con- 
clusion seems clear from a consideration 
of the curves for the hydrops and 
masked-normal groups. If the factor of 
lowered speech discrimination scores 
in the hydrops group were solely re- 
sponsible for their performance, a com- 
parable effect would not have been 
expected in the masked-normal group, 
where there was no impairment in 
speech discrimination. The fact that 
both groups behaved in virtually identi- 
cal fashion suggests the factor common 
to both groups, loudness recruitment, 
as the underlying cause. 


The unexpectedly high error scores 
in the otosclerotic group are intriguing, 
yet difficult to explain, since loudness 
recruitment is presumably absent in a 
purely conductive loss. The essentially 
negative results obtained in the plugged- 
normal group fail to support either age 
or the fact of a hearing loss, per se, as 
possible explanations for the curious 
results in the otosclerotics. 


The clinical implications of the find- 
ings reported in this study deserve 
serious consideration. It has been sug- 
gested previously (3) that an individ- 
ual’s spondee threshold can be approxi- 
mated from the lowest intensity level 
at which speech disturbance can be 
measured under exposure to delayed 


speech feedback. The results of the 
present investigation indicate, however, 
that this estimate must necessarily differ 
depending on whether the subject does 
or does not have a hearing loss accom- 
panied by loudness recruitment. Sup- 
pose, for example, that a subject mani- 
fests evidence of the delayed speech 
feedback effect at a hearing level of 
70 db (re normal threshold for spondee 
words). According to the conventional 
yardstick, his true spondee threshold 
would be estimated at no more than 30 
db. The present results indicate, how- 
ever, that his loss could be as great as 
50 or even 60 db if it were accom- 
panied by recruitment, or even if it 
were caused by otosclerosis, for that 
matter. 


Tiffany® interprets the present find- 
ings to mean that, in matters of ad- 
judication, it may be desirable to esti- 
mate the actual loss closer to the feed- 
back response level than the behavior 
of normals would imply. Not to do so 
would, in his opinion, be to err in favor 
of the defendant in a damage suit. 


Summary 


The purpose was to investigate the 
effect of delayed speech feedback at 
sensation levels of 0, 10, 20, 30, 40, and 
50 db re spondee threshold upon five 
groups: (a) a normal-hearing group; 
(b) a labyrinthine-hydrops group (pre- 
sumably with recruitment); (c) a 
masked-normal-hearing group (with 
characteristics analogous to recruit- 
ment); (d) an  otosclerotic group 
(without recruitment); and (e) a 
plugged-normal-hearing group (with- 
out recruitment). There were 10 sub- 





’Personal communication with author, 1959. 
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jects in each group. High median error 
scores in the hydrops and masked- 
normal groups were presumably due to 
the presence of recruitment. High error 
scores in the otosclerotic group were 
apparently due to reasons other than 
the recruitment phenomenon and re- 
main unexplained. 

Some implications of these results for 
the clinical utilization of delayed speech 
feedback as a predictor of the true 
spondee threshold are discussed. 


References 


1. Butter, R. A., and Gatroway, F. T., 
Factoral analysis of the delayed speech 
feedback phenomenon. J. acoust. Soc. 
Amer., 29, 1957, 632-635. 


2. Exuiorr, C. M., Responses of hard of 
hearing subjects to delayed side-tone. 
Master’s thesis, Vanderbilt Univ., 1955. 


3. Hantey, C. N., and Trrrany, W. R., An 
investigation into the use of electro- 
mechanically delayed side tone in audi- 
tory testing. J. Speech Hearing Dis., 19, 
1954, 367-374. 





~~ wf fe A Bie Gene fa. ia!) a ae ei! ee 








Language Skills 


of Profoundly Deaf Adolescent Children 


SIDNEY GODA 


This study investigated the chief lan- 
guage skills of adolescent pupils at a 
residential school for the deaf. Four 
areas were selected for investigation: 
(a) relationship between chronological 
age and language skills, (b) relation- 
ship between intelligence and language 
skills, (c) interrelationships among in- 
dividual language skills, and (d) differ- 
ences between sexes in development of 
language skills. 

This study was conducted at the 
(Flint) Michigan School for the Deaf, 
a state residential school for deaf chil- 
dren. Here training in oral speech is 
emphasized until the child is 12 years 
of age; after that, reading and writing 
receive greater emphasis than oral 
speech. At the time the children reach 
senior high school, they are separated 
into oral and manual groups: the oral 
group composed of children who have 
progressed satisfactorily in using oral 
speech, the manual composed of chil- 
dren who have not progressed satisfac- 
torily in using oral speech. 


Procedure 


Subjects. The subjects selected for 
this study, all enrolled at the Michigan 





Sidney Goda (Ph.D., University of Mich- 
igan, 1957) is Director of Speech and Hear- 
ing Therapy, Johnstone Training and Re- 
search Center, Bordentown, New Jersey. This 
article is based on a doctoral dissertation 
completed under the direction of Dr. Warren 
Ketcham. 


Volume 2, No. 4 


369 


school, were 56 deaf adolescent chil- 
dren who were without any apparent 
secondary handicaps. All children se- 
lected had losses of 75 db or greater in 
each frequency of the speech range, 
512, 1024, and 2048 cps, and equal or 
greater losses at the higher frequencies. 
Each child was either congenitally deaf 
or deafened by the age of two years. 
All had entered school at or before 
the age of six, had received their entire 
education at a residential school for the 
deaf, and were placed in oral classes in 
senior high school. 


The 56 subjects, 32 males and 24 
females, were divided into age groups 
as follows: 10, age 12; 13, age 13; 17, 
age 14; and 16, ages 16 to 18 inclusive. 


Measurement of Skills. Four language 
skills were measured: writing, speak- 
ing, lipreading, and reading. 

Writing. Two samples obtained from 
each child served as a basis for evalu- 
ating written language: (a) a composi- 
tion on “Why I Like Summer Better 
Than Winter’ or “Why I Like Winter 
Better Than Summer”! and (b) 10 sen- 
tences in response to 10 individual pic- 
tures, one sentence response per pic- 
ture. The 10 sentences and composi- 
tions were evaluated as one unit. The 
writing measures were collected from 
groups of six children. Two meetings 





*Two previous studies (1, 8) report on the 
adequacy of these topics with deaf children. 
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were conducted to obtain the samples, 
the first meeting being devoted to the 
compositions and the second meeting 
being devoted to the sentences in re- 
action to the pictures. 


Speaking. Two samples obtained 
from each subject served as a basis for 
evaluating spoken language: (a) 10 sen- 
tences in response to the same 10 pic- 
tures as were used to measure writing 
ability, one sentence response per pic- 
ture, and (b) six sentences to tell a 
story in response to six pictures, one 
sentence response per picture. There 
were thus 16 spoken-sentence responses. 
The samples of spoken language were 
obtained individually. Because the chil- 
dren were responding orally to the 
same pictures to which they had for- 
merly responded in writing, an interval 
of a week was allowed to separate the 
two presentations in order to foster use 
of different vocabularies for the spoken 
and written responses. The child was 
instructed to repeat each sentence 
three times while the experimenter 
wrote down what was said. By being 
able to see the child and the material 
to which he was reacting, the experi- 
menter was usually able to understand 
what the child was saying. In each in- 
stance, however, the subject was shown 
what was written down and asked if 
it was what he had spoken. The child 
was given an opportunity to make cor- 
rections. When the child could not be 
understood, he was instructed to write 
down what he had said. This procedure 
insured maximum understanding of 
every word spoken by the child. 


Lipreading. A four-part test was 
constructed to measure lipreading abil- 
ity: part 1, 10 items, each consisting of 
two words (an article and a noun); 
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part 2, eight items, each consisting of 
a three-word phrase; part 3, five items, 
each consisting of a four-word sen- 
tence; and part 4, six items, each con- 
sisting of a five- or six-word sentence. 
There were thus 29 items in all. The 
test was composed of those words 
which the majority of children had 
used at least once in their written com- 
positions. Thus a child was penalized 
minimally for his inability to spell 
words or to understand the meanings 
of the words. 


The lipreading test, given to groups 
of six children, was administered 10 
different times. In the administration of 
the test each item was spoken in a soft 
voice which the children could not 
hear because of severe hearing losses. 
Each item was spoken three times in 
succession. Then the children were al- 
lowed sufficient time to write the item. 
During the presentation the children 
were seated around a rectangular table, 
three on either side, with the natural 
light from the window at the side of 


* the children and directly over the head 


of the experimenter who was reading 
the test material. Six feet was the maxi- 
mum distance at which any individual 
child was seated from the speaker. 
Children with eyeglasses or visual diffi- 
culties were placed closest to the 
speaker. 


Reading. Reading ability was meas- 
ured by the children’s scores from the 
reading portions of the elementary and 
intermediate batteries of the 1953 re- 
vision of the Stanford Achievement 
Test, administered during the 1953-54 
school year. 


Intelligence. The IQ of each child 
was determined by his performance on 
a nonlanguage intelligence test. All of 
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the children had been tested between 
1952 and 1954 by the school psycholo- 
gist. Two tests were used, the children 
between the ages 12-14 being given 
the Chicago Non-Verbal Test, and the 
older children being given the Revised 
Beta Performance Test. 

Scoring Procedures. Three measures 
were obtained from two writing sam- 
ples, two of a quantitative nature and 
one of a qualitative nature. The quan- 
titative measures, used by numerous 
experimenters (6) in evaluating the 
language ability of hearing children, 
were length of composition and aver- 
age sentence length. The total number 
of words which the child used yielded 
the score for the length of the compo- 
sition. Average sentence length was de- 
termined by dividing the total number 
of words by the number of sentences. 
The qualitative measure, used for the 
development of the sentence, was de- 
termined by use of an 11-point scale 
which the writer constructed. The 
scale ranged from 1, for a completely 
unintelligible response showing no 
concept of a sentence, to J1, for an 
easily intelligible, elaborated sentence 
made up of more than two clauses. 


Each written sentence was evaluated 
separately. The average score of all 
the sentences was the average score 
for development of written sentence. 

The same three measures were used 
to evaluate spoken language ability: 
average length of spoken composition, 
average length of spoken sentence, and 
average score for development of 
spoken sentence. The same method 
which had been used to determine the 
development of the written sentence 
was used also to determine the devel- 
opment of the spoken sentence. 

In the measurement of lipreading 
ability, each word correctly under- 
stood by the child was graded sepa- 
rately. An article received a score of 
one-half point, and any other word re- 
ceived a score of one point. The maxi- 
mum score was 90. 

The reading score was the average 
of the two scores from the reading por- 
tions of the Stanford Achievement 
Test, paragraph meaning and word 
meaning. 

There were thus eight measures: 
three for written language, three for 
spoken language, one for lipreading, 
and one for reading. 


TaBLE 1. Mean language measures by age groups: 12, 13, 14, and 16 to 18 years, with Ns of 10, 13, 
17, and 16, and mean IQ’s of 96.5, 96.3, 91.4, and 103.1, respectively. 











Language Measures Ages 

12 13 14 16-18 

Length of Written Composition in Words 195.6 193.1 182.8 198.1 

Development of Written Sentence 4.5 4.1 3.5 5.1 
(maximum score, 11) 

Length of Written Sentence in Words 7.9 7.6 6.7 8.7 

Length of Spoken Composition in Words 101.0 99.0 83.5 108.2 

Development of Spoken Sentence 4.7 4.2 3.5 4.7 
(maximum score, 11) 

Length of Spoken Sentence in Words 6.0 6.2 5.5 6.8 

Lipreading (maximum score, 90) 37.1 37.5 28.2 53.8 

Reading Grade Level 3.1 2.7 2.6 4.2 
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TaBLe 2. Pearson product moment correlation coefficients for all possible pairs of language measures . 
(With 54 df, a correlation of .26 is significant at the 1% level, .34 at the 5% level.) 








Language Measures 2 





Length of Written Composition .57 
Development of Written Sentence 

Length of Written Sentence 

Length of Spoken Composition 
Development of Spoken Sentence 

Length of Spoken Sentence 

Lipreading 

Reading Grade Level 


ONAAr SOOM 


.63 27 51 .24* 44 49 
-69 59 -70 48 59 71 


45 43 56 
-40 51 
.61 








* Not significant. 


Results 


Reliabilities of the two measures used 
for the expressive language skills of 
writing and speaking were estimated 
by means of the split-half method em- 
ploying the Spearman-Brown formula. 
The reliabilities of the average score 
for development of written sentence 
and average length of written sentence 
were both .90 while the reliabilities of 
the average score for the development 
of spoken sentence and average length 
of spoken sentence were .84 and .92, 
respectively. The reliability for the lip- 
reading test, calculated in the. same 
manner, was .62. Two possible causes 
may account for this relatively low re- 
liability: (a) the type of material 
(words, phrases, and sentences) which 
made up the lipreading test, since pre- 
vious research (2) has demonstrated 
that lipreading scores vary with the 
type of material, and (b) the changing 
conditions (particularly lip move- 
ments and their visibility) during the 
10 administrations of the test. 


Chronological Age and Language 
Skills, The means for the eight language 
measures (Table 1) show a general 


decrement from the group of 12-year- 
old children through the next two older 
groups. However, the oldest group (16 
to 18 years) generally scored higher on 
the language measures. 

In order to test the significance of 
the differences in performance among 
three of the four age groups (12-year, 
13-year, and 14-year), partial correla- 
tions were figured between chronologi- 
cal age and the eight language measures 
with IQ constant.? Two negative cor- 
relations, average score for development 
of spoken sentence and total amount of 
speaking, are significant for these three 
age groups. For the entire group (all 
four age levels), however, five corre- 
lations between language measures and 
chronological age are positive and sig- 
nificant. The three measures which did 
not show positive significant correla- 
tions with age for the total group were 
the ones for the amount of writing, the 
development of spoken sentence, and 
the amount of speaking, these latter 
two being the measures of oral per- 





*This score was held constant because the 
correlation coefficient between chronological 
age and IQ for these three groups was —.30, 
which is significant at the .10 level. 
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TaBLE 3. Factor loadings of eight language-measure variables on first two factors extracted in centroid 


factor analysis. 











Tests Factor A* Factor BY Communality (h?) 
Length of Written Composition 63 41 .5650 
Development of Written Sentence .84 .10 -7681 
Length of Written Sentence .80 .25 .6500 
Length of Spoken Composition .80 -.53 .9209 
Development of Spoken Sentence ay .05 . 5954 
Length of Spoken Sentence .73 -. 56 .8465 
Lipreading . 66 14 -4552 
Reading Grade Level .78 14 .6280 








* General language factor. 
t Language confidence factor. 


formance which showed significant de- 
creases from the 12-year, to the 13- 
year, to the 14-year age group. 

Intelligence and Language Skills. 
Partial correlations for the three age 
groups (12-year, 13-year, and 14-year) 
with chronological age constant re- 
vealed five measures significantly re- 
lated to intelligence: all three measures 
of written language, average score for 
development of spoken sentence, and 
average reading grade level as deter- 
mined by scores on the reading por- 
tions of the Stanford Achievement 
Test. The reading score is the only 
measure related significantly to intelli- 
gence with the 16-to-18-year age group. 

Interrelationships of the Language 
Skills. Pearson product-moment corre- 
lation coefficients were computed over 
the entire age range for all possible 
pairs of tests in order to estimate the 
degree of interrelationship of the lan- 
guage skills. The intercorrelations are 
presented in Table 2. Twenty-seven 
out of the 28 correlations are positive 
and significant. 

Underlying Abilities. A factor analy- 
sis was carried out to procure infor- 
mation concerning the underlying 


abilities accounting for the observed 
significant correlations among the eight 
tests. The factoring was done by 
Thurstone’s centroid method to two 
factors, and the resulting matrix is 
shown in Table 3. After the removal of 
these two factors, no further factors 
were extracted because of the small size 
of the residuals. Their mean absolute 
value was .04, and only one was above 
.10. The communalities listed in Table 
3 indicate the amount of variance which 
the two factors contribute to the tests. 
With the exception of the lipreading 
test, these two factors account for over 
one-half of the variance of each test. 

Sex Differences in Language Skills. 
Differences between the sexes* were 
determined for each of the eight lan- 
guage measures. The differences all 
favored females, but none of these dif- 
ferences was statistically significant as 
determined by Fisher’s t test. The meas- 
ure which showed the largest sex dif- 
ference was the one for the develop- 
ment of the spoken sentence, significant 
beyond the 10% level. 





*No significant differences were found be- 
tween the sexes in intelligence or chrono- 
logical age. 
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Discussion 


The inconsistent patternings of lan- 
guage skills found with the four age 
groups in this study (see Table 1) are 
similar to those found in several other 
studies. Pintner (7) comments on the 
erratic pattern of growth in the lan- 
guage of the deaf child. Lyon and 
others (4) found uneven growth in 
reading, with the largest improvement 
occurring above grade nine where the 
average age was 18. The results of two 
other studies show an uneven pattern- 
ing in written language. Heider and 
Heider (1) found decreases in sentence 
length between 13- and 14-year-old 
children and between 15- and 16-year- 
old children; Weils (8) found seven 
instances in intermediate grades where 
the language of older deaf children was 
inferior to the language of younger 
deaf children. 


The kind of instructional program 
presented in the particular school used 
in the present study may account for 
(a) the significance of decreases in 
oral language measures from the 12- 
year to the 13-year to the 14-year age 
group, (b) the nonsignificance of in- 
creases in oral language measures from 
the youngest to the oldest group, and 
(c) the significance of increases in the 
writing and reading measures from the 
youngest to the oldest group. In the 
curriculum under discussion, oral lan- 
guage is stressed until the age of 12. 
After this age less stress is placed on 
oral language. The child at age 12, then, 
may have reached a high point in his 
ability to use oral speech. Increased 
emphasis on reading and writing after 
age 12 appears to parallel a greater pro- 
ficiency in these language skills than 
in oral language, which is relatively 


neglected in the educational program 
of the adolescent children. 

Intelligence was found to be more 
closely related to the language meas- 
ures for the 12-year, 13-year, and 14- 
year age groups than for the 16-to-18- 
year age group. Therefore intelligence 
would seem to play a more significant 
role in language development for the 
younger adolescent deaf child than for 
the older adolescent deaf child. 

The intercorrelations of the language 
measures as shown in Table 2 indicate 
a high consistency of interrelatedness. 
Twenty-seven out of 28 correlations 
are positive and significant. There is 
apparently a tendency for the deaf 
child who is superior in one language 
skill to be relatively superior in all 
language skills and for the deaf child 
who is inferior in one language skill to 
be relatively inferior in all language 
skills. Thus the child who excels in 
reading may be expected, in general, to 
lipread, write, and speak at a relatively 
high level while the child who reads 
at a lower level may be expected, in 
general, to lipread, write, and speak at 
a relatively low level. 

Further, within the two expressive 
skills of writing and speaking, there 
was found to be a significant relation- 
ship between the quantity of expression 
and the quality of expression. The child 
who uses a relatively greater number of 
words in his written or oral language 
may be expected to express himself 
with relatively longer and more com- 
plex sentences. The average length of 
spoken sentence is, however, not related 
significantly to the average length of 
written composition. 

The two factors which were ex- 
tracted in the factor analysis account 
for over one-half the variance of each 
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test with the exception of the lipread- 
ing test. The Heiders (2) have stated 
that individual factors are responsible 
for superiority in lipreading. Therefore 
this language skill may not be affected 
by any general ability to the same ex- 
tent as are the other language skills. 

Factor A, general language factor, 
was found in each of the eight tests, 
suggesting that a single factor under- 
lies all of the language measures. Ket- 
cham (3), Williams (9), and Yedinack 
(10) suggest that a general language 
factor underlies the individual language 
skills for hearing children. Olson (6) 
implies that in language development 
there is an underlying unity which is 
expressed in the various structures and 
functions that are measured. 

Factor B, language confidence fac- 
tor, was found to exist in three of the 
tests: amount of writing, amount of 
speaking, and average length of spoken 
sentence. This factor was present to a 
lesser extent in the average length of 
the written sentence. Factor B was 
positively weighted in the two written 
tests but negatively weighted in the 
two oral tests. Negative loading paral- 
lels an inhibition of speaking but a 
positive loading is coincident with a 
greater quantitv of writing. 

An analysis of the school environ- 
ment and test situation may account for 
the presence of Factor B. The older 
children generally use sign language, 
relying very little on oral speech for 
purposes of communicating their 
thoughts. Therefore the oral speaking 
test may have been fear provoking to 
the subjects since it demanded a form 
of response which was somewhat for- 
eign to them and one which they did 
not feel adequate in handling. A fur- 
ther element in creating fear was the 


presence of the experimenter, who was 
not only a normal hearing person and 
stranger but also one who could not 
communicate with or understand sign 
language. Since this test was conducted 
on an individual basis, the child did 
not receive any outside support from 
others in his school environment, which 
would have served to minimize the 
fear. Being fearful of speaking and lack- 
ing confidence in their ability to speak, 
the children spoke as little as possible, 
with as short sentences as possible. 


The written tests, however, required 
a response for which the children felt 
adequate. They had practiced writing 
in the classrooms and were confident 
of their ability to express themselves 
with written language. Further, groups 
of children were present and responded 
at the same time. Any fear produced in 
an individual child by the presence of 
the experimenter was probably allayed 
to a great extent by the presence of 
other children from the same school 
environment. Because of a feeling of 
confidence in their ability to express 
themselves with written language, the 
children wrote relatively long compo- 
sitions with relatively long sentences. 


The differences in language measures 
all favor the female, but these differ- 
ences lack statistical significance. These 
findings are comparable to results found 
with normal hearing children as sum- 
marized by McCarthy (5, pp. 492-630). 


Summary 


The chief language skills of 56 pro- 
foundly deaf adolescent children at a 
residential school were studied. The 
56 subjects, 32 males and 24 females, 
were divided into four age groups. 
Four language skills were measured: 
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writing, speaking, lipreading, and read- 
ing. 

The following conclusions are drawn: 
(a) because of the consistent interrela- 
tionships of the language skills, the pre- 
diction may be made that the deaf child 
who is superior in one will generally 
be superior in all of the skills, and the 
deaf child who is inferior in one will 
generally be inferior in all of the skills, 
and (b) the quantitative and qualitative 
aspects of expressive language appear to 
be related; thus the child who uses a 
relatively larger number of words in 
his speaking, and writing will generally 
express himself with relatively longer 
and more complex sentences. 
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Language Skills of 


Male and Female Kindergarten Children 


HARRIS WINITZ 


Previous language investigations have 
generally shown girls to be superior to 
boys on several language skill: (4, 5, 6, 
7, 11, 20, 25). The differences, however, 
have been small and have rarely been 
statistically significant. In addition, the 
differences favoring girls have not been 
consistent with regard to the age level 
or language skill investigated. For ex- 
ample, Templin (20) reported a statis- 
tically nonsignificant difference favor- 
ing boys on all language skills investi- 
gated at age three (the first age level of 
her study), but McCarthy (J1) found 
girls to be superior to boys on these 
skills at age three. With regard to ar- 
ticulation skills Davis (5) and Templin 
(18) found that girls achieved mastery 
of articulation earlier than boys, but 
Roe and Milisen (14) reported no statis- 
tically significant differences between 
boys and girls on the number of er- 
roneously uttered sounds at comparable 
age levels (grades 1 through 6). 


A summary of findings regarding sex 
differences on each of six commonly 
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employed language measures reported 
in the literature ( 2, 4, 6, 7, 11, 14, 15, 
17, 20) is presented in Table 1. These 
differences favor girls almost two 
to one, but the differences are statisti- 
cally significant in only a few instances. 


In many of these investigations, how- 
ever, both intelligence and socioeco- 
nomic status, two variables which are 
known to be related to the linguistic 
development of children, have not al- 
ways been equivalent for male and 
female children. In McCarthy’s study 
(11) the girls were more intelligent and 
were selected from higher socioeco- 
nomic levels than the boys. Fisher (7) 
reported IQ scores for the girls higher 
than those for the boys. In some in- 
stances Templin’s (20) 1Q-equivalents 
are higher for the girls than for the 
boys. 

Population differences between sexes 
either in intelligence as measured by IQ 
scores or in socioeconomic status should 
presumably not exist because (a) in- 
telligence tests have been designed (16, 
22, 24) so as not to discriminate be- 
tween sexes, and (b) it does not seem 
reasonable to assume that more girls 
than boys are found in higher socioeco- 
nomic classes. 


If real differences in language devel- 
opment exist between sexes, it becomes 
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TaBLe 1. Summary of findings regarding sex differences on each of six measures reported in the litera- 
ture (2, 4, 6, 7, 11, 14, 15, 17, 20) on language skills of children. Entries under Age Levels indicate 
the total number of tests; the remaining entries show the distribution of results. 











Measure Age Levels Superior Significantly Superior 

Boys Girls Neither Boys Girls 

Mean Length of Response 39 13 22 4 0 1 
Variability of Response 8 3 5 0 0 0 
Number of Different Words 15 5 10 0 1 0 
Structural Complexity Score 8 6 0 0 1 
Articulation 36 14 21 i 0 3 
Vocabulary 18 ff itil 0 0 1 
Total 124 44 75 5 1 6 








important to understand those environ- 
mental, physiological, and mental fac- 
tors which differentially affect the sex- 
es. A knowledge of such factors may 
lead to a greater understanding of the 
processes of language development and 
retardation. However, before attempts 
are made to explain sex differences, it 
appears necessary to establish, using 
large samples of subjects, whether or 
not significant sex differences can be 
demonstrated. 

The present investigation was planned 
in an effort to answer the following 
question: are the reported relatively 
small language differences which favor 
females real differences which would 
be significant with large samples of sub- 
jects or are they chance differences? 


Procedure 


Selection of Subjects. The subjects 
“were selected from children who were 
either living in Iowa City homes or re- 
siding in towns immediately surround- 
ing Iowa City and who were expected 
to enter the kindergarten classes of the 
Iowa City schools in September, 1958. 
Seventy-five girls and 75 boys were 
selected randomly from lists of enter- 


ing children made available through the 
office of the Iowa City Superintendent 
of Schools. 


Sampling Procedure. The proportion 
of kindergarten children scheduled to 
enroll at each school, out of the total 
number of children expected to enter 
all Iowa City kindergarten classes, was 
determined, the appropriate proportion 
of the total sample of 150 was then 
selected from the kindergarten enrollees 
of each school. An equal number of 
boys and girls was selected from each 
school, exclusive of children of Uni- 
versity of Iowa students, rural children, 
and twins. Children of University of 
Iowa students living in temporary mar- 
ried-student housing were excluded 
because it was difficult to classify these 
families on a socioeconomic scale and 
it was felt that adequate testing fa- 
cilities would not be available in their 
homes. Rural children were eliminated 
because difficulties encountered in ob- 
taining the cooperation of these families 
did not warrant their inclusion. Twins 
were eliminated because a dispropor- 
tionate number of twins in either sex 
group might substantially affect scores 
obtained on certain language measures. 
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Subjects were restricted to physically 
normal white children who were from 
monolingual homes, were considered to 
be normal in intelligence (that is, with 
Full Scale IQ’s of 70 or above on the 
Wechsler Intelligence Scale for Chil- 
dren), were currently considered by 
their parents not to stutter, and were 
found to have normal hearing. The 
children’s hearing was tested by a port- 
able Microtone ADC audiometer at 
4000 cycles at an intensity level of 20 
db. This screening procedure, as re- 
ported by House (9) and House and 
Glorig (10), accomplishes essentially 
the same purpose as any presently used 
auditory screening method. 


‘Nonlanguage Measures. Information 
was obtained upon which a comparison 
of the boys and girls could be based, 
as indicated below: 

Chronological Age: The CA of each 
child was determined to the nearest 
month, 

Intelligence: Each child was ad- 
ministered the Wechsler Intelligence 
Scale for Children (WISC) (24). From 
this test three IQ scores were obtained: 
Verbal Scale IQ, Performance Scale 
IQ, and Full Scale IQ. 

Socioeconomic Status: The socioeco- 
nonic class of the child’s family was 
estimated through the use of the Index 
of Status Characteristics (23). 

Family Constellation: The family 
constellation of each child was clas- 
sified into one of the following five 
categories: (a) only child, a child hav- 
ing no older or younger sibling within 
six years of his age; (b) singleton with 
older siblings, a child having one or 
more older siblings within six years of 
his age but having no younger sibling 
within six years of his age; (c) single- 
ton with younger siblings, a child hav- 
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ing one or more younger siblings with- 
in six years of his age but having no 
older sibling within six years of his 
age; (d) singleton with older and 
younger siblings, a child having at least 
one older and one younger sibling 
within six years of his age. 


Language Measures. The following 
language measures were administered to 
all subjects: 

Verbalizations:* Fifty verbal respon- 
ses were elicited in the following man- 
ner: the examiner presented one Chil- 
dren’s Apperception Test (3) card at a 
time and engaged the child in conversa- 
tion with neutral comments such as 
“Tell me about this picture.’ Analyses 
were made of mean length of response, 
number of one-word responses, mean 
number of words in the five longest 
responses, mean standard deviation, 
number of different words, and struc- 
tural complexity. Methods of scoring 
were similar to those used by McCar- 
thy (11), Davis (4), and Templin 
(20). The procedure suggested by 
Templin (20, pp. 81-82) for quantify- 
ing structural complexity was followed. 

Articulation Measures: The Templin 
Screening Test of Articulation (19) 
was administered. An attempt was 
made to elicit each sound spontaneously 
in response to a pictorial stimulus. 
When this procedure failed, the child 
was asked to repeat the word after 
the examiner. These two methods have 
been found by Templin (78) to give 
essentially the same results. Each item 
was scored as correct or incorrect. 

Vocabulary: The Ammons Full- 
Range Picture Vocabularly Test, Form 
A (1), was administered. 

*See Appendix for procedures of elicitation 


and scoring not specified by previous investi- 
gators. 
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TaBLe 2. Means, standard deviations, and ranges of nonlanguage measures for boys, girls, and sexes 
combined. Values in parentheses are normative means and standard deviations from Manual for 


Wechsler Intelligence Scale for Children. 











Measure Boys Girls Sexes Combined 

Mean’ SD. Mean SD. Mean SD. Range 

CA (months) 63.76 2.65 63.24 2.83 63.50 2.65 60-72 

Socioeconomic Status 42.07 12.57 42.92 11.53 42.49 12.08 21-76 

WISC Verbal Scale IQ 100.05 18.71 100.95 9.75 100.50 11.91 69-142 
(99.5) (13.8) (99.8) (13.4) 

WISC Performance Scale IQ 98.23 12.49 102.76 10.34 100.49 11.66 72-138 
(98.6) (16.9) (101.0) (18.9) 

WISC Full Scale IQ 99.07 12.85 101.95 9.22 100.51 11.27 72-183 
(99.0) (15.4) (100.4) (18.2) 








Word Fluency Measures: Four meas- 
ures of word fluency were obtained: 
rimes, child names, adult names, and 
thing names. Procedures for adminis- 
tration of these items were adapted 
from Gewirtz (8). The child was given 
one minute for each of the following 
six tasks: (a) name as many words as 
possible that rime with the ‘sounds’ 
-ail, -ake, and -ing; (b) name as many 
children, adults, and things as possible. 


Order and Place of Test Administra- 
tion. The tests were administered to 
each child in two one-hour sessions, as 
follows: in the first session the Per- 
formance and Verbal Scales of the 
WISC were administered and the word- 
fluency measures were obtained. In the 
second session the articulation and vo- 
cabulary tests were administered and 
50 verbal responses were elicited and 
recorded. The two testing sessions were 
completed within three days. Only the 
examiner, who administered all tests, 
was present with each subject. Except 
in the case of two boys, all tests were 
conducted in the child’s home, usually 
in his bedroom or in a playroom. 


Examiner Reliability. The reliability 
of the examiner was determined in the 
following manner: 

Observer Reliability. The Templin 
Screening Test was administered to six 
kindergarten children, their responses 
being recorded by the author and one 
other trained examiner. Percentages of 
observer agreement were calculated for 
each sound for each child. The average 
percentage of agreement was 93%. 

In order to determine how well two 
observers agree, using the McCarthy- 
Davis procedure, in identifying a spo- 
ken response, the author and one other 


Taste 3. Summary of ¢ tests for evaluation of 


differences between boys age girls on severa 
nonlanguage measures (df = 148). 











Mean 
Measure Dif- t Sign. 
ference Level 
CA .52 1.14 .30 
Socioeconomic Status .85 .43 .70 


WISC Verbal Scale IQ .90 .55 .70 


WISC Performance 
Scale IQ 


WISC Full Scale IQ 


4.53 
2.89 


2.41 .02 
1.57 20 
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Taste 4. Chi square testing independence of 
frequencies of family constellation types for boys 
and girls. 








Family Constellation 





Type Boys Girls Total 
Only Children* Bre 6 
Only Older Siblings 28 20 48 
Only Younger Siblings 16 22 38 
Older and Younger 
Siblings 26 27 53 
Total 70 69 139 


Chi square (df = 2) = 2.29; P = .50 








* Frequencies of only children were excluded in 
the chi-square computation. 


observer recorded on paper the first 
and last word of 15 remarks from the 
same six subjects. Percentage of agree- 
ment was determined by comparing 
the first and last words of the 15 re- 
sponses recorded by the two observers. 
The average percentage of agreement 
was 94%. In the investigation proper 
the author identified each response in 
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the same way and in addition each sub- 
ject’s responses were tape recorded so 
that a typed script of the complete set 
of remarks could be prepared. 

Scorer Reliability. The self-agreement 
of the examiner was found for length 
of response and structural complexity 
score. From the typed protocols a ran- 
dom selection of the records of 10 sub- 
jects (a total of 500 responses) was 
made and each response was rescored. 
The obtained percentages of agreement 
were 99% and 96% for the rescoring 
of length of response and structural 
complexity score, respectively. The re- 
liability coefficients in general indicate 
high examiner agreement. 


Results 

Nonlanguage Measures. The means, 
standard deviations, and ranges for 
chronological age, socioeconomic sta- 
tus, and IQ are presented in Table 2. 
The results of the t tests of significance 
of sex differences are given in Table 3. 
The only nonlanguage difference that 


TABLE 5. Means, standard deviations, and ranges for verbalization measures, Templin Screening Test 
of Articulation, Ammons Full-Range Picture Vocabulary Test, and word-fluency measures for boys, 


girls, and sexes combined. 











Measure Boys Girls Sexes Combined 
Mean _ SD. Mean’ SD. Mean SD. Range 
Length of Response 5.16 1.52 5.61 1.76 5.39 1.66 1.7-13.8 
Number of One-Word Responses 8.33 5.39 7.40 3.87 7.87 4.80 1-34 
Mean of Five Longest Responses 13.44 5.69 16.40 7.85 14.92 7.01 4.4-52.8 
Mean §8.D. 3.60 1.29 4.11 1.58 3.85 1.46 1.2-16.2 
Number of Different Words 102.24 25.44 110.29 24.98 106.27 25.53 43-181 
Structural Complexity Score 38.43 14.56 42.37 13.96 40.40 14.39 1-73 
Templin Test 40.47 10.15 41.69 9.33 41.08 9.75 7-50 
Ammons Test 25.19 5.39 25.04 4.24 25.11 4.90 14-38 
Rimes 2.69 3.16 2.41 3.00 2.55 3.16 0-12 
Child Names 7.75 3.00 9.48 4.00 8.61 3.61 2-21 
Adult Names 3.36 3.00 4.23 3.16 3.79 3.16 0-14 
Thing Names 12.04 5.83 11.93 5.00 11.99 5.39 0-26 
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TaBLeE 6. Summary of ¢ tests for evaluating dif- 
ferences between boys and girls on the several 
language measures (df = 148). 











Mean 
Measure Dif- t Sign. 
ference Level 
Length of Response 45 1.64 10 
Number of One-Word 
Responses .93 1.21 .30 
Mean of Five Long- 
est Responses 2.96 2.63 .O1 
Mean §.D. 51 2.17 .05 
Number of Different 
Words 8.05 1.94 10 
Structural Complexity 
Score 3.94 1.68 .10 
Templin Test 1.22 sae: .50 
Ammons Test 14 .02 .90 
Rimes .28 .55 .70 
Child Names 1.73 2.98 .O1 
Adult Names .87 Mg 10 
Thing Names stl .12 -90 








reaches significance at the .05 level is 
on the WISC Performance Scale IQ. 
However, the WISC Full Scale IQ 
scores for the two groups were nearly 
equivalent and the groups may be con- 
sidered essentially the same in intellec- 
tual ability. 

The frequencies of family constella- 
tion types for boys and girls are sum- 
marized in Table 4. The chi square of 
2.29 was not significant, indicating that 
the distribution of children in each of 
the family constellation categories em- 
ployed in this test was independent of 
sex. 

It appears, then, that the groups of 
boys and girls in this study were essen- 
tially equivalent with regard to the non- 
language variables shown in previous 
language investigations to be impor- 
tantly related to the linguistic achieve- 
ment of children. 

Language Measures. The means, 
standard deviations, and ranges of the 


language measures are presented in 
Table 5. The results of the ¢ tests are 
given in Table 6. The .05 level of con- 
fidence was established as the minimum 
probability level required for detecting 
statistical significance. 

Verbalizations. As shown in Table 6, 
the direction of the differences for. all 
six verbalization measures is in favor 
of the girls. However, the differences 
are significant in the case of only two 
measures, mean of the five longest re- 
sponses and mean standard deviation. 

Templin Screening Test of Articula- 
tion. The difference in articulation 
scores favors the girls by a little more 
than one sound. This difference, how- 
ever, is not significant. 

Ammons Vocabulary Test. The dif- 
ference in vocabulary scores favors the 
boys, but the difference is small and 
nonsignificant. 

Word-Fluency Measures. For two 
measures, Rimes and Thing Names, the 
differences favor the males and for two 
measures, Child Names and Adult 
Names, the differences favor the fe- 
males. Only the difference for Child 
Names is significant. 


Discussion 


Verbalization Measures. A more ad- 
vanced development of girls over boys 
at five years of age on the verbalization 
scales is not demonstrated in this study. 
The obtained values indicate significant 
differences on two of three variability 
measures but do not indicate significant 
differences on the verbalization meas- 
ures which are generally regarded as of 
major importance: length of response, 
structural complexity score, and num- 
ber of different words. These statisti- 
cally nonsignificant results indicate con- 
siderable overlap between the two 
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groups on these measures. It would ap- 
pear that on the basis of the statistical 
tests employed, the hypothesis of no 
difference between sexes on the major 
verbalization measures is tenable. Fur- 
thermore, the values presented in Table 
5 indicate that the differences between 
the sexes are not of sufficient magnitude 
to justify regarding the groups as essen- 
tially different in verbalization skills, 
thus contraindicating (a) investigations 
attempting to explain sex differences 
and (b) the clinical practice of consid- 
ering the verbalization skills of the two 
groups as different (as suggested by the 
use of separate norms, for example). 

It may be argued, however, that since 
the ¢ values in all instances approached 
or reached significance, there may be 
slight differences favoring the girls on 
all verbalization measures but that these 
differences would be detectable only 
with a sample of subjects considerably 
larger than that employed in this in- 
vestigation. Both the cost of conduct- 
ing such an expanded investigation and 
the possible limited usefulness of the 
results make the practicability of such 
an investigation questionable. 

The possibility may be suggested 
that one of the reasons for not obtain- 
ing statistically significant differences 
in favor of the girls was that the exam- 
iner was a male. However, three of the 
10 persons who administered tests in 
the Templin investigation were male, 
and one of the men did a substantial 
amount of the testing.? Since children 
in the present culture typically interact 
with members of both sexes, it would 
seem that the sex of the examiner is not 
an important source of error. 


*Personal communication from Dr. Templin, 
1959. 
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In this study the only values which 
compare closely with those of Tem- 
plin (20) pertain to length of response. 
Noticeable discrepancies appear be- 
tween the mean values of the other ver- 
balization measures in this study and 
the corresponding values in Templin’s 
study. These differences arose in all 
probability from the use of different 
stimulus materials and from procedural 
differences, the most important of 
which was probably the use of a tape 
recorder. In previous studies the accu- 
racy of transcribing responses rested 
entirely with the listener, who had to 
make rapid judgments concerning fleet- 
ing verbalizations. This investigator 
found in typing the tape-recorded re- 
sponses that it was usually necessary 
to replay the responses several times 
in order to be absolutely certain of 
what the child had said. In many in- 
stances the examiner felt that he had 
heard a verb, article, or conjunction, 
but upon replaying he often found that 
the child had omitted these words. This 
reason can account for the fact that 
the structural complexity scores are 
lower than those reported by Templin. 


Templin Test of Articulation. On 
the basis of the findings, the hypothesis 
of no difference between sexes on the 
Templin Test appears tenable. Differ- 
ences favoring females have not been 
observed as frequently in articulatory 
skills as in verbalization measures. Roe 
and Milisen (14) and Sayler (15), who 
studied approximately 2000 children 
from grades one through 12, found no 
statistically significant differences be- 
tween boys and girls with respect to 
the correct production of consonant 
sounds. 

It may be hypothesized that there are 
no real differences between boys and 





girls of five years of age and older with 
respect to articulatory skills and that 
the differences previously emphasized 
may be an overflow generalization from 
the verbalization measures. However, 
the results of this study and the studies 
of Roe and Milisen and Sayler do not 
permit a statement regarding the com- 
parative articulatory skills of boys and 
girls below five years of age. 


Ammons Vocabulary Test. The dif- 
ference between sexes in vocabulary 
skills observed in this study, as meas- 
ured by the Ammons Test, was negli- 
gible and statistically nonsignificant. 
Girls have been generally ‘regarded 
(12) as being more proficient i in vocab- 
ulary skills than boys. This opinion may 
have been fostered by the early Smith 
(17) investigation. However, most 
often the makers of vocabulary tests 
have attempted to create tests that are 
equivalent for both sexes. Therefore, 
it is difficult to say, on the basis of 
standardized vocabulary tests, whether 
or not a sex difference exists. If one 
were to investigate further sex differ- 
ences with regard to vocabulary acqui- 
sition, a logical starting point might be 
the readministration of the Smith Test 
to a large sample of boys and girls. 

The scores for both the girls and the 
boys in this study were found to ex- 
ceed by approximately four words 
the scores reported by Ammons and 
Holmes (2). As the examiner checked 
all responses that appeared to be guesses, 
the difference between the scores in 
this sample and those in the Ammons 
and Holmes standardization sample may 
reflect a true advancement in children’s 
vocabulary skills since the standardiza- 
tion of the test and its administration 
by Templin approximately 10 years 
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ago. This possibility is, indeed, not sur- 
prising. The Revised Stanford-Binet, 
Form L, is now reported to be yielding 
higher scores than were reported in the 
standardization norms (13). 


Word-Fluency Measures. On the 
basis of the results obtained in this 
study it seems reasonable to conclude 
that girls do not possess a general supe- 
riority in word fluency. They do ap- 
pear to be distinctly superior in naming 
children, Female superiority in naming 
children may be the result of the tend- 
ency for girls to be more sociable than 
boys (22). However, this superiority 
was not demonstrated in the naming 
of adults. 


Summary 

This investigation sought to discover 
whether previously reported and rela- 
tively small differences favoring female 
children are chance differences or real 
differences which would be significant 
with large samples. Subjects were 150 
randomly selected, normal, five-year- 
old children (75 boys, 75 girls) from a 
single community. The two sex groups 
were essentially the same with regard 
to chronological age, IQ, socioeconomic 
status, and family constellation, On lan- 
guage variables, statistically significant 
differences favoring the girls were 
found on two of six verbalization meas- 
ures and on one of four word-fluency 
measures. No significant differences 
were found on the Templin Articula- 
tion Test and the Ammons Vocabulary 
Test. It was concluded that the hy- 
pothesis of no language difference be- 
tween the sexes is tenable in the popu- 
lation of five-year-old children with 
regard to major verbalization measures, 
articulatory skills, vocabulary skills, and 
three of four word-fluency measures, 

















Language Skills of Kindergarten Children: Winitz 385 


References 


1. 


™ 


13, 


14. 


Ammons, R. B., and Ammons, HELEN 
S., The Full-range Picture Vocabulary 
Test. Louisville, Ky.: Psychological Test 
Specialists, 1948. 


. Ammons, R. B., and Hotes, J. C., The 


full-range picture vocabulary test: III. 
Results for a preschool-age population. 
Child Develpm., 20, 1949, 5-14. 


. Betziax, L., and Betiak, Sonya S., Chil- 


dren’s Apperception Test (manual, rev. 
ed.). New York: C.P.S. Co., 1950. 
Davis, Epirra A., The development of 
linguistic skills in twins, singletons with 
siblings, and only children from age five 
to ten years. Inst. Child Welf., Monogr. 
Ser., No. 14. Minneapolis: Univ. Minn. 
Press, 1937. 


. Davis, Irene P., The speech aspects of 


reading readiness. In Chap. 2 in Newer 
Practices in Reading in the Elementary 
School (17th yearb., dept. elem. school 
prin., NEA), vol. 17, no. 7, 1938. 


. Day, Extra J., The development of Jan- 


guage in twins: I. A comparison of twins 
and single children. Child Develpm., 3, 
1932, 179-199. 


., Fisner, Mary S., Language patterns of 


preschool children. Child Develpm. 
Monogr. (Columbia Univ.) No. 15, 1934. 


. Gewirtz, J. L., Studies in word-fluency: 


I. Its relation to vocabulary and mental 
age in young children. J. genet. Psychol., 
72, 1948, 165-176. 


. House, H. P., A new instrument and con- 


cept for the rapid detection of a possible 
hearing impairment. Trans. Amer. Acad. 
Opthal. Oto-laryng., 61, 1957, 228-230. 


. House, H. P., and Groric, A., A new 


concept of auditory screening. Laryngo- 
scope, 67, 1957, 661-668. 


. McCartuy, Dorotnea A., The language 


development of the preschool child. 
Inst. Child Welf., Monogr. Ser., No. 4. 
Minneapolis: Univ. Minn. Press, 1930. 


. McCartuy, Dorornea A., Language de- 


velopment in children. Chap. 9 in L. 
Carmichael (Ed.) Manual of Child Psy- 
chology (2nd ed.). New York: John 
Wiley, 1954. 

Pastovic, J., and Guturre, G., Some evi- 
dence on the validity of the WISC. J. 
cons. Psychol., 15, 1951, 385-386. 

Rog, Vivian, and Miuisen, R., The effect 
of maturation upon defective articulation 
in elementary grades. J. Speech Dis., 7, 
1942, 37-45, 


15. 


16. 


i. 


19. 


20. 


4 e 


yr 


23. 


24. 


25. 


Sayuer, Heten K., The effect of matura- 
tion upon defective articulation in grades 
seven through twelve. J. Speech Hearing 
Dis., 14, 1949, 202-207. 

SeasHore, H., WesMan, A., and Doppett, 
J., The standardization of the Wechsler 
Intelligence Scale for Children. J. coms. 
Psychol., 14, 1950, 99-110. 

SmitH, Maporau E., An investigation of 
the development of the sentence and the 
extent of vocabulary in young children. 
Univ. Ia Stud. Child Welf., 3, No. 5, 1926. 


. Temp.in, Mirprep C., Spontaneous versus 


imitated verbalization in testing articula- 
tion in preschool children. J. Speech 
Hearing Dis., 12, 1947, 293-300. 
Tempun, Mitprep C., Norms on a 
screening test of articulation for ages 
three through eight. J. Speech Hearing 
Dis. 18, 1953, 323-331. 

Tempun, Mirprep C., Certain language 
skills in children, their development and 
interrelationships. Inst. Child Welf., 
Monogr. Ser. No. 26. Minneapolis: Univ. 
Minn. Press, 1957. 

Terman, L. M., and Merritt, Maun A., 
Measuring Intelligence. Boston: Hough- 
ton Mifflin, 1937. 

Terman, L. M., and Tyzer, Leona E., 
Psychological sex differences. Chap. 16 
in L. Carmichael (Ed.), Manual of Child 
Psychology (2nd ed.). New York: John 
Wiley, 1954. 

Warner, W. L., Meeker, Marcuia, and 
Eetts, K., Social Class in America. Chi- 
cago: Science Research Assoc., 1949. 
Wecuster, D., Manual for Wechsler In- 
telligence Scale for Children. New York: 
Psychological Corp., 1949. 

WEeE.iMan, BetH L., Case, Ina M., MEn- 
GERT, Ipa G., and Brapsury, Dorotuy E., 
Speech sounds of young children. Univ. 
la Stud. Child Welf., Vol. 5; No. 2, 1931. 


Appendix 


Procedures of elicitation and scoring not 


included by McCarthy (11), Davis (4), or 
Templin (20) are indicated below: 


Recording the Remarks: The following 


definition was added: (1) Remarks connected 
by interjections and conjunctions, such as 
‘and, ‘um, ‘er, ‘uh,’ etc., are considered as 
separate remarks if the remarks appear to be 
clearly enumerative. Examples: “The girl . : . 


er. 


‘The boy . um. ‘And the monkey.’ 


Each example was considered a single remark. 
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Typing the Remarks: The following pro- 
cedures were followed in typing the remarks 
from the tape recordings: 

A. Repetitions. Since the repetition of 
words and phrases was not considered in- 
dicative of linguistic maturity and since it 
would substantially add to the length of the 
uttered response, it was decided to exclude 
repetitions according to the following rules: 


1. Word repetitions. Repetition of the 
same word in a sentence was typed only 
once: ‘He, he, he went home. “The boy 
ran ran he ran away.’ The italicized words 
were excluded. 

2. Phrase repetition. Repetition of the 
same phrase in a sentence was typed only 
once: ‘He was, be was hurt. ‘And he hit, 
and he hit him, The italicized words were 
excluded. However, the insertion of at 
least one different word nullified a phrase 
repetition: ‘And he hit, and he spanked 
him.’ “They were going, they were having 
fun.’ Word repetition within a phrase rep- 
etition did not nullify a phrase repetition: 
‘And he’s, he’s he’s, and he’s home. The 
italicized words were excluded. Contrac- 
tions were separated for purposes of this 
analysis and did not nullify a repetition: 
‘He’s eating, he is eating the mouse.’ The 
italicized words were excluded. This pro- 
cedure was also followed for word rep- 
etition. 

3. Repetitions not excluded. When a 
child repeated a word for enumerative 
purposes and for starting a new ‘thought 
unit, he was not penalized for repetition: 
‘That’s a bear there, that’s a bear there, 
that’s a bear there.’ (Child was pointing as 
he spoke.) ‘I think that that is a bear.’ 
‘That’s what they are, they’re bears.’ Both 
‘that’ and ‘they’re’ were used as part of a 
new subordinate or independent clause. 
Where it could not be determined whether 
a repetition was used for emphasis or con- 
stituted a nonfluency, the repetition was 
excluded. 


B. Incompleted words. Words which 
were not completed by the child were re- 
corded as though they were completed. Ex- 
amples: ‘I th— (think) I know he’s going 
home.’ ‘He might ma—(make) build a fire.’ 
It was especially important to follow this 
procedure with regard to the repetition 
analysis. In many instances the child re- 
peated phrases but did not complete words 
within the phrase. Example: ‘I th—(think) 
he, I think he went home. This sentence 
could hardly be interpreted, as ‘I he, I think 
he went home.’ 


nN 


C. Noises. Noises were typed only when 
they were considered an integral part of the 
sentence. ‘Ahhh grrr, the lion is eating. the 
monkey.’ ‘The. lion is going to grrr the mon- 
key.’ In the first instance the noise was con- 
sidered unessential and was excluded as not 
indicative of the child’s linguistic maturity. 
In the second instance the noise clearly took 
the part of a verb and was included. 


D. Interjections. Interjections not’ con- 
sidered dictionary items and functioning 
solely to connect words or phrases were not 
included: ‘er, ‘um,’ etc. However, interjec- 
tions which served as words were included: 
‘uhhuh,’ ‘hmmm.’ The first interjection served 
as a substitute for the affirmative word ‘yes’ 
and the second as the negative word ‘no.’ 


E. Colloquialisms and neologisms. All 
colloquialisms and neologisms were included: 
‘wham,’ ‘whoops,’ ‘yike,’ ‘ya, ‘yipee,’ ‘teensy- 
weensy, ‘naw,’ ‘yeah,’ etc. 

F. Incomprehensible remarks. Some re- 
marks were incomprehensible because the 
child either turned from the microphone, 
whispered, mumbled, or uttered an inarticu- 
late word. Remarks in which all words were 
not clearly comprehensible to the examiner 
were excluded. 


Analyses Made: 


1. Number of different words. 

a. Contractions except for ‘ain’t’ were 
-divided into two words. For example, 
‘didn’t’ and ‘they’re’ were changed into ‘did 
not’ and ‘they are.’ Both examples were 
tabulated as two different words. 

b. Plurals and possessives were counted 
as separate words. 

c. The variant colloquial forms of ‘mam- 
ma’ were grouped together: ‘mom,’ ‘mom- 
my, ‘ma.’ However, the word ‘mother’ 
was listed separately. The variant colloquial 
forms of ‘papa’ were grouped together: 
‘pa, ‘pop. However, the words ‘daddy’ 
and dad were listed as separate words 
and were not included with the colloquial 
forms of ‘papa.’ The word ‘em’ was listed 
together with the word ‘them. 


. Structural analysis. 

a. Sentences beginning with the words 
‘but,’ ‘and,’ and. ‘then’ were considered 
complete. Sentences beginning with the 
conjunctions ‘cause, ‘for,’ and ‘because’ 
were considered incomplete. 

b. Compound sentences with one phrase 
were classified as elaborated and scored 
with a weight of 4. 
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Relationships Between 


Language and Nonlanguage Measures 


of Kindergarten Children 


HARRIS WINITZ 


In a previous investigation (6) boys and 
girls of kindergarten age were com- 
pared on several language and non- 
language measures. The present report 
examines the relationships among the 
major language and nonlanguage varia- 
bles for the children who served as 
subjects in that investigation. 


Procedure 


A complete account of the subjects, 
measures, and procedures used in the 
collection and analysis of the data has 
been given by Winitz (6). In the pres- 
ent investigation Pearson product- 
moment correlations were computed 
between the major language and non- 
language variables for boys, girls, and 
boys and girls combined. The Fisher 
% test was employed to determine the 
statistical significance of differences be- 
tween correlations computed separately 
for boys and girls between any two 
variables. The nonlanguage variables 
were socioeconomic status and three 


IQ’s as measured by the Wechsler In- 
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telligence Scale for Children (WISC): 
Verbal Scale IQ, Performance Scale IQ, 
Full Scale IQ. The language variables 
were verbalization measures (mean 
length of response, number of one-word 
responses, mean of the five longest re- 
sponses, number of different words, 
structural complexity score, and mean 
standard deviation), Templin Screening 
Test of Articulation score, size of 
vocabulary as measured by the Am- 
mons Full-Range Picture Vocabulary 
Test, and four fluency measures (rimes, 
child names, adult names, and thing 
names). 


Results and Discussion 


Pearson product-moment correlations 
among the major language and non- 
language variables are presented in Ta- 
bles 1, 2, and 3. A brief discussion of 
some of the correlations follows. 


Socioeconomic Status, The correla- 
tions between the scores for socio- 
economic status, based upon the Index 
of Status Characteristics (4), and 14 
language and nonlanguage variables are 
small and for the most part do not dif- 
fer significantly from zero (Table 1). 
The only correlations significant at the 
.05 level (using data for boys and girls 
combined) are between socioeconomic 
status and the WISC Verbal Scale IQ, 
Performance Scale IQ, Full Scale IQ, 
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TaBLe 1. Pearson product-moment correlations 
computed between the major language and non- 
language measures for boys (B), girls (G), and 
boys and girls combined (C). 








Socio- 
economic 
Status Verbal 


WISC Verbal IQ 
B 


WISC IQ 
Perf. Full Scale 





26* 
(G) a" 
(C) .26f 
WISC Performance IQ 
(B) .38 57 
(G) 13 42 
(C) .26 51 
WISC Full Scale IQ 
(B) 35 .90 .86 
(G) 24 .85 .83 
(C) .30 .88 85 
Length of Response 
(B) 18 .20 .16 21 
(G) 13 .09 13 12 
(C) 15 15 .16 .18 
Number of One-Word Responses 
(B) .00 —.42 -.21 -.37 
(G) -.19 -.19 -.20 -.27 
(C) -.07 -.385 -.22 -.33 
Mean of Five Longest Responses 
(B) .18 23 13 21 
(G) abl .03 .09 -.04 
(C) 12 13 .05 sa. 
Number of Different Words 
(B) .09 .29 .06 22 
(G) 23 .09 .05 .08 
(C) 15 21 .09 18 
Structural Complexity Scor 
(B) oA, .26 13 .23 
(G) .O1 oy .20 .22 
(C) .09 .23 .18 .24 
Templin Screening Test of Articulation 
(B) : 29 34 
(G) .20 34 .22 33 
(C) 12 33 sai: |S 64 
Ammons Fuli-Range Picture Vocabulary Test 
(B) .18 16°" 49 “iar 
(G) .21 35 .52 .53 
(C) .19 .61 49 .64 
Rimes 
(B) 25 oy dale be ae 
(G) 14 .02 25 .10 
(C) .20 23 .29 .29 


(Continued next column 








*Throughout table, for boys (B) and girls (G) 
separately, 7.95 (nearest tabled df=70) = .23, 
T 91=.30. 

{Throughout table, for boys and girls combined 
(C), T.95 (nearest tabled df=150) =.16, 7.9;=.21. 
**Difference between 7s, calculated for boys and 
girls separately, significant at or beyond .05 level 
as determined by Fisher 2 Test. 


Table 1 (Continued; for footnotes, see pre- 
ceding column) 











Socio-_ WISC IQ 
economic 
Status Verbal Perf. Full Scale 
Child Names 
(B) .08 21 Bit} 21 
(G) .10 .29 -.06 .14 
(C) .08 24 .07 19 
Adult Names 
(B) -.05 .27 16 24 
(G) .10 .07 212 .10 
(C) .02 .18 16 19 
Thing Names 
(B) .09 44 42 49 
(G) .14 37 24 36 
(C) 12 .41 34 43 








vocabulary size, and rimes; none of the 
correlations exceeds .30. 

Correlations of this small magnitude 
are probably the result of the homo- 
geneous social status of Iowa City fami- 
lies. The score derived from the use of 
the Index of Status Characteristics is 
based upon ratings of occupation type 
and level, source of income, house size 
and type, and area lived in. The average 
Iowa City family lives in a small or 
medium-sized house in an average resi- 


‘ dential area. A large proportion of the 


fathers are salaried. Perhaps the only 
item that might importantly discrimi- 
nate among families in lowa City would 
be the occupation of the father. If cor- 
relations were made between occupa- 
tion of father and the various language 
and nonlanguage variables, a higher 
relationship might be shown. 


Intelligence. The intercorrelations of 
the WISC IQ’s (Table 1) compare 
favorably with the intercorrelations re- 
ported by Wechsler (5) in the stand- 
ardization sample for children of 7.5 
years. (He does not report intercorre- 
lations for the 5.5-year group.) In the 
present study the correlations, using 
data for boys and girls combined, are 
.51 between Performance Scale IQ and 
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Verbal Scale IQ, .88 between Verbal 
Scale IQ and Full Scale IQ, and .85 
between Performance Scale IQ and Full 
Scale IQ; the corresponding values re- 
ported by Wechsler are .60, .90, and 
.89, respectively. In the present investi- 
gation differences between intercorre- 


lations of the WISC IQ’s calculated 
separtely for boys and girls were not 
statistically significant. 


V erbalizations, The correlations (Ta- 
ble 1) between the verbalization scores 
and various nonlanguage measures are 


TaBLe 2. Pearson product-moment correlations computed between the major language measures 
for boys (B), girls (G), and boys and girls combined (C). 











Measures 
2 3 4 5 6 7 8 9 

1. Length of Response 

(B) .69*F .55 .22 20 .00 14 .07 13 

(G) .44* .44 -.03 10 a .10 .14 15 

(C) .56** .50 10 .16 10 .14 12 15 
2. Number of Different Words 

(B) .53 sat .30 AZ -.01 .28 15 

(G) .59 .06 .03 .03 -.03 .06 .16 

(C) 57 .14 18 -10 .02 18 15 
8. Structural Complexity Score 

(B) .28 .28 .08 cae re .25 

(G) .29 .30 .01 -.10 12 .05 

(C) .29 .28 .04 .06 13 -16 
4. Templin Screening Test of Articulation 

(B) .33 .18 .23 .29 41 

(G) 24 15 .16 .08 .24 

(C) .28 .16 .20 .20 .30 
5. Ammons Full-Range Picture Vocabulary Test 

(B) .36 .18 .12 .45t 

(G) 15 .01 .13 .14 

(C) 7 .08 .12 .32 
6. Rimes 

(B) - ee .07 .25 

(G) .00 .25 15 

(C) 04 15 21 
7. Child Names 

(B) .16 .39 

(G) .34 .38 

(C) .29 .36 
8. Adult Names 

(B) .28 

(G) .06 

(C) 7 
9. Thing Names 








* Throughout table, for boys (B) and girls (G) separately, r.o5 (nearest tabled df=70) = .23, r.9)=.30. 
t Difference between rs, calculated for boys and girls separately, significant at or beyond .05 level 


as determined by Fisher 2 test. 


** Throughout table, for boys and girls combined (C), 7.95 (nearest tabled df=150) = .16, r.9,= .21. 








TaBLE 3. Pearson product-moment correlations 
computed between four verbalization measures 
for boys (B), girls (G), and boys and girls com- 
bined (C), 








Number of Mean of Five 
Measure One-Word Longest MeanS.D. 
Responses Responses 





Length of Response 


(B) -.62* 54T .87T 

(G) -.41* sok .54 

(C) -.52** 35 -68 
Number of One-Word Responses 

(B) -.51f -.38 

(G) -.13 -.28 

(C) -.31 ~.33 
Mean of Five Longest Responses 

(B) .64t 

(G) 31 

(C) 46 








* Throughout table, for boys (B) and girls (G) 
separately, 795 (nearest tabled df=70) = .28, 7,93= 
.30. 


+ Difference between rs, calculated for boys and 
girls separately, significant at or beyond .U5 level 
as determined by Fisher 2 test. 

** Throughout table, for boys and girls combined 
(C), f.95 (nearest tabled df=150) = .16, 7.9, = .21. 


not high. For example, the sexes- 
combined correlations between length 
of response and socioeconomic status 
and WISC Full Scale IQ are .15 and 
.18, respectively. The correlations (Ta- 
ble 2) between the verbalization scores 
and the other language measures also 
are low. The highest sexes-combined 
coefficient obtained was .29 (between 
structural complexity score and Tem- 
plin Screening Test of Articulation) 
and the lowest was .02 (between num- 
ber of different words and child names). 

The intercorrelations (Tables 2 and 
3) of the verbalization measures indi- 
cate moderately high relationships. 
They are, however, somewhat lower 
than those obtained by Templin (3) 
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in her study of the language skills of 
children between the ages of three and 
eight. The moderately high positive 
relationships (with the exception of the 
moderately high negative correlation 
for number of one-word responses) 
would seem to demonstrate a common 
verbal factor among the verbalization 
measures. However, the correlations 
might also be interpreted in the follow- 
ing manner: the longer the response 
a child uses, the more chance he has 
of using more different words and the 
more chance he has of using sentences 
which are more complex in structure. 

An examination of Table 3 indicates 
that children who utter long responses 
tend to use few one-word responses, 
as indicated by a —.52 sexes-combined 
correlation between length of response 
and number of one-word responses. A 
sexes-combined correlation of .68 be- 
tween length of response and mean 
standard deviation seems to indicate 
that children who use long responses 
are more variable in the length of their 
uttered responses than are children who 
use shorter responses. 


Templin Screening Test of Articula- 
tion. The sexes-combined correlations 
(Tables 1 and 2) between the Templin 
Test and the language and nonlanguage 
variables are low and often not signifi- 
cantly different from zero. Templin (3) 
reported correlations of .52 and .61 for 
five-year-old children between articu- 
lation test scores and total words used 
(comparable to length of response) and 
number of different words, respec- 
tively. Correlations of this magnitude 
were not found in the present study. 


Ammons Full-Range Picture Vocab- 
ulary Test. The sexes-combined corre- 
lations between scores on the Ammons 





, 
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Test and the Verbal Scale IQ and Full 
Scale IQ are .61 and .64, respectively. 
Ammons and Holmes (J) report a cor- 
relation of .83 between scores on their 
test and the Stanford-Binet MA, Form 
L, for children from two through five 
years. The higher correlation reported 
by Ammons and Holmes was probably 
a function of two things: (a) employ- 
ment of an IQ test different from that 
used in this study and (b) computation 
of a correlation over a range of ages 
between abilities that increase with age. 

Word-Fluency Measures. The inter- 
correlations between the four word- 
fluency measures and the correlations 
between the word-fluency measures and 
the other language and nonlanguage 
measures are, for the most part, positive 
and low. Gewirtz (2) reports correla- 
tions of a higher magnitude among the 
word-fluency measures and between the 
word-fluency measures and two verb- 
alization. measures (number of words 
and nurhber of different words). None 
of his correlations, however, exceeds .50. 

The interrelationships found among 
the various language variables were not 
of sufficient magnitude to warrant for 
predictive purposes the substitution of 
any one measure for any other. The 
relative independence of these variables 
suggests that until investigation indi- 
cates which measures are generally eval- 
uated as indicative of language retarda- 
tion, they should all (with the possible 
exception of the variability measures 
of verbalization and the word-fluency 
measures) be included as diagnostic 
tools of language development. 


Summary 


Interrelationships among several Jan- 
guage and nonlanguage variables were 
examined for 150 randomly selected, 


normal, five-year-old children (75 boys, 
75 girls). Pearson product-moment cor- 
relations were computed for boys and 
girls separately and for the sexes com- 
bined, Comparisons of the correlations 
for boys and girls indicated that few of 
the differences were statistically sig- 
nificant. Very high correlations were 
found between the WISC Performance 
Scale and Verbal Scale IQ’s and the 
Full Scale IQ. Moderately high corre- 
lations were found between the WISC 
Performance Scale IQ and the WISC 
Verbal Scale IQ; between the Ammons 
Test scores and the WISC Full Scale 
IQ; and among the various verbaliza- 
tion measures. Low correlations were 
found between socioeconomic status 
and the other variables; between the 
language measures and the Full Scale 
IQ; between the verbalization measures 
and the Ammons Test scores; between 
the Templin Screening Test of Articu- 
lation and the other variables; among 
the various word-fluency measures; and 
between the word-fluency measures 
and all of the other measures. 
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